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COMPUTER-BASED STRATEGY FOR PEPTIDE AND PROTEIN 
CONFORMATIONAL ENSEMBLE ENUMESAIION AN© LIG ANB AFFINITY 

ANALYSIS 

BACISGEOSJNS OF THE M^MTIOM 
pOSil This appii^MiDH claims priority to 0,S. Pro^sloital i^lieatiorx No. 

60/275,144, which was filed m Mmck 12; 2QfI ... 

|O©02] The IIS, Govern rights m the invention by virtue of a 

grant from DAMFA> 

L Field of -ifife iB^eMloB 

|0003] The invention generally relates to the field of structural biology. It concerns a 
ma&od of modeling the slmctara of a pepiid^ a^^ oi that peptide by 

the insertion of an ajnino add not Batiirally foimd in that position in the peptide. It also 
concerns a Blet&dd Ibr ' assessiisg the l>|m<feig affinity of a peptide to a template molecule and a 
method for determining the rate of loop closure hi a peptide via a disulfide bond. 
It Related Art 

|O1>04] The |>r6teiri modeling approach ^ 
method of precli^tmg wte or other iii a peptide m order to 

stabilize ffie peptide. 

pMS] The i^^Kiimeitsional straetoe of prated 
of ways. One of the most well tooTO way of de^ermh^g 

of fee te^ Using this teeimicpe^ it is possible to elucidate the 

three-dDBaisional struchire with goo^^j^ structere may be 

detemiined €2Mogh: : &e use of the techrnqnes of Benson difitaeti o% or by endear magnetic 
resonance (HMR). 

[G0§.6J : The fhree-dtoeosioBal structure of many proteins may be cfeaeteri^ed as 
having hitenial sBrfeees (directed away fern the aqueous environment in which fee protein is 
normally found) and external surfaces (which are exposed to the aqneous envirormi^ot). 
Through the study of many namral protems, researchers have discovered that h^dro 
residues (such as tryptophan^ p^ or methionine) are 

most feeqnentiy feimd osi the Is^^ M contrast, hydro|>Mlic 

residues (snch as aspartate, asparagme, giotamate, glnianhBe, lysine, arginiiie, serisie, and 
thrwmae) are most fteqneally found on the external protein surfaces. The amino acids 
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alanine, cysteine, glyciise > histidine^ proline,, sedae 9 tyrosine, Md ihreoBlite .are aacoimtered 
with more Bearly equal aad external protein surfeces. 

P&07] The feiologi^al properties of proteins depmd directly cm ie proteins three- 
dimensional 0&$ conformation. The.: doMoi^atioB. deiermmps &® activity of enzym^ the 
capacity and Bpeciiiexty W %mMn$ prDfems, and the strtifitoral aitrihistes of receptor 
molecules, Eaok protein lias m astronomical ximaiber of possible coiifemiatiaes (about 1016 
for a small proteio of 100 residues, and ttoe has been bo reliable method pioMiig the one 
coofbrina^osi that prsdominates in aqueous solHtiosi. A seoornl difficulty is feat tee are no 
accurate and reliable force la^s fer the ister&otfeii of one part of a protein witli . another part 
and with water. These and other factors have coi^feiited to the enomxons complexity of 
detemiiBiiig th .e iriost probaW© relative location of each residua in a known |^tem 

ISfiOil The proten folding pt^^m^ tkm prohiem of detennmmg a proteirfs three- 
dliBeBsional t^ary struct^ fe seqEen&e, was first fyrm^k^d mom ^m 

half a century ago. Early ofeser^mtions a*id later experiments have lead to the contemporary 
>view feat protein cmiitoistion is determined solely by fee amino acid science md thaA 
there exists a iinique Bative-c0i3foma1ion ill wMekresidil^ #st^nt to sequence htst f^oxitiiate 
in space engender a olose-paei^ residues As a mmki of fhe 

revolution ib molecular biology , tfee nni^ of known proteixi seejaences is about 50 times 
greater tbnn the number of known tfeee-dimensioiial.: proteto steipftires, This disparity binders 
progress in many ^eas of^ a protein sequence has little nieamsg outside 

the context of the llM^-diinensxoiml stmet&m 

P§S9J The protein modeMiig.^proaeli^ of flie. pt^s^iit iweiitioxi pim'Mes m efficient 
method of predicting where to ms^^ other amino adds fe a peptide, m ordsr to 

stabilize the peptide.;. The present invention provides 3& e^semhle-hased all-atom nwSio4 
imni-protein modeling program. (MPMOD), to stabilize a. protein, to provide higher ai&ity 
binding, 

OF THE IJWSmTIOH 
f@0t 6} Thereiore, it is an oty active of the praseat inwntioB to provide a method to 

generate aad analyze «^ and predict the afinity of 

a gW€3i eon&imation of the for a target intern 

10011] An embodiment of the preseM imveMion is a computer-assisted metfeod fcr use 
In.modilyhig.;a : ;pr0tem eoraprcsing th^ steps of: pipmtimg a peptide sequence ami parameters 
for analysis into a eoi^uter«assisted inodeitog program; rmidomiy genetathig I;/, m; 
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generating a peplide backbone; perfoxraing a ^asa 4m Wmh check of the baekfeons; 
caloilatijvg & solyetit accessible surface based OEetgy of all cosibnBers; modeling the 
disBlfide(s); perfbrmmg s van d^ i^^^ eacii roian^ 

solvent aooessibfe^^ of all eonfbrmers. In a further anifeodime&i, fee 

protem may coisipise a feptiSe. & yet a fbxt!i^ 1^-'-s^^<^\ms^. comprise 

perlbmimg a binding test for each corsfeirmer with, a template molecmle. In. a fbrtber 
embodiment, the method may £02^^ bend loop closure 

imtti J^rn&m m&Q<3m^ of the mventiom is a jixethorf of protein mmiatpti2^Moa 
con^rising iBoefeliBg a protein to have Hie necessary active site con&miaMon t%$mg. the 
method above wMle x^ncing the total mimfeer -of amino acids in tfee proteiii. 

PDI3| Yet ano&^ between a 

potem and a template rpMeenie by decreasing the ec^rmatioBal entropy loss tsp on bindmg 
by tte protein comprising the centralist of at least one loop of an unstable region of trie 

1 001 4] i^oiiar aanbod is & eoMpute-assisted method 

fe ixs« In modifying a protein oomprisiiig tfe^ steps cjf: : 3?^!^tttSB^.;-a -^egp*id»" ^q^teao^b. into a 
eonipi^er-assisted niMeliBg pmgrmj fepi^ttiag parameters lor analysis into a 
computer, assisted modeling program; gaierating 4, \\k m angles randomly m allowed region 
of MamaohisBdraii maps; assigning aiagles M «k residue of the backbone; generating, 
backbone atoms for R CA, C; genera^^ 

Waals dieclc fer ^aeh atom; modeling disulfide bonds a^d reoordmg fee disulfide eoordieat^ 
pairs; adding zmmmsm to residues; porform^ der Waais eh0ofc for $aeb xotamer; 

performing foiiidii% test with a t^nplate protein; mid caioxdatmg solvent accessible swfece 
based energy for eaoh conformed a fbitber em.^diment : the protein nmy mmpme a 
peptide. 

[HOlSf Another MnbodhBenf o£ ffee present invention is a compiit^r-a^sisted mstbod 
for use in. ni^ditying a protem comprising foe steps of: hipislting a peptide secpene e into a 
eon#iiter^assisted modeling progOTB: i^ntting parameters for analysis into a 
computer-assisted modeling pmgxam: generating ^ W &&&& randomly in allowed region 
of Rainaobandran maps; assigning angles to each rssidiie of the backbone; generating 
feaekhone atoms M, CA, C: generating the rest of tbe feacKhone atonas; perlbiimng van der 
Waals check with all other atom after each atom is added; adding rot&mers to residues; 
ofeeeMng distance pairs between atoms; modeling disifiStte bonds and recoiling tte disulfide 
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coordinate pairs; perfofmmg ^an der Waals cheek ^■^ : -^^fide''lHto<fe :: :^^ the eornplete 
cmxibmm:; imm^g mm^>m: of confbrmers that are able to form disulfide bonds; ami 
calculating solvent aeeessibte s^rf&^e basad esei^ lor confemier, 

[0016] Yet anot&sr embodiment of the pmsmit mention is a eoinpnter^assisied 
method for use in modifying a protein eQmp^siEg the steps of: inputting a peptide sequence 
and parameters for analysis into a computer-assisted modeling program; searcMBg of 
ponfoi&a€OBal Jpaee in the allowed regioim of the Raaaadia^b^ 

and C teffisM of^fe as the Mgh resolution stmcture; eliecMog the 

haridedness of the eoi^bimer; aligning the eonfeHiKr to the high resoMion structure; and 
performing a -yah der Waals calcnLation. 

P®!?1 Stili another oiibodimeat of the pre&eiii Myentioii is a coio|?ufe-assxsted 
method f&r we of: inpatiiig residues B^mhei^ <sf 

the Sexxfeie loop of a protein into a c:o^ipel^assisted modeling px>gmni; Inptiitbg 
parameters &r analysis into a compiiter^assi^ted modeling pro gram; general | 5 to angles 
randomly in allowed region of Ramaehandran mapi; generating: ba^bos^ aiomk; p^lomiing 
a CA-CA. distance cheek for the N n&d G terniimj mteniiM 3Sf and G termini of the 
con&rmer to be the same as the high resoiHtiaii straetinre; efeeclciag the handedness of the 
ccsnfbtitien : ^erfonMng: vsa* Waate eheefe on haefehone atoms; aEgning 
the M^i resolution stmetere;. performing vaxi der /Waals cteefc on haekbone and template 
protein atoms; adding sMeeb&ins to the baefcho&e atoms; and perlbrming van der Waals 
cbeck oh all atoins. 

|90i8| A& embodiinent of the pms&ni mrenimtx is a method for determining the rate 
of disnlfide bond loop closnre m a proteK* eoi^prislng m least one fwp-oysteine motif 
represented by C-X^C where n is an integer, die method ccmiprising the steps of: perforrmBg. 
a ^ari der Waais caleiilatlon on a :mu!%B&ity of eoiifomiers of the peptide and subttactkig. 
those; eomfOH&era : that easiiiot . fotm an intxainole^ yield an ensemfele of N c . 

stetically allowed confermers; analysing the ensemble of sterieally allowed eonfecmers to 
yield an ensemble of J% con&nners that mn potentially form an inlxaxnolecular disulfide 
bond; ami calculating th^ ratio w^ 

in the peptide. In another embodiment the rate may be eonipared to the rale of dkislfide-boiK! 
loop closure of the peptide cor&ainmg at least one dilSrent two-cy stehie inoti£ In a further 
emhodioieiit, £he method may comprise the M®p of generating peptide backbone eomxiiriates 
Ibr the C-X^-C motif ftom, standard bond aoagfes, hond fengths and % m dihedral angles 
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ra&domly obtained witfein the ^3W^ : -*e^^ map fisr «aefe residue to 

yield ttte iMeltipliclty of : &o*tfdi^ In a li&feere^ 

fiir&er cor^rise tlie step of tisi^ to geai^ate CkXfrC side diam 

coordinates to jfeki tfee mMtipEclty of coBfemers of fee peptide Ip ^ aii^tfa^r emtedimant of 
the present teention, analysing si^o^ly aMowec! ^Momets. ::: M^ ecscolptise e&Ioniatiiag 
fee free energy of tile eanfeimers l>ase<i npon the solvent a&esssibie surface area. In yet 
another embodiment of the present im^entiOB, analysis oonfernim 
iBHy fuiiiiesr eonipxise flexibly^ ekams, In & flr^^ 

■tib© present invention, the method, may €om|>rl^ ^ by the 

diffimoee ib tree enei^ and diMISde^ of i&e G~XrC motif 

and calculating tie ratio 

sL; e ■ / JUi e 

m i mo 

wliicfa represents, ene£|^^elghted rale of disniide loop closure ixi fhe peptide, M 
a ferther eit^odment of the present iriveMo% tfie method may ooxnpdse fee step of 
identifying an eme-mble of eonfbrmers of the poiein iMt can potenMally £oxm an 
intmmoleeular <MMiMde feoad. In another embodiineM of the present indention* docking the 
ensemble of M c eonlbrmers to a hixidiiig site on a template feioiaotec^le to yieM &s ensemble 
of aligtied co^ and performing a yaxi der Waais cMeidatian on the ensemble of 

aligned eonfotmers to yield an ensemble of sterieally allowed confbmiem lhat bind to fee 
template bioinoleeule. In a fisrtber embodiment doc^ng tiie ensemble of '$3$ eonibimers to a 
binding site on a template Momol ectde tnay eom^rise of; aMgning the ensemble of 

H c confbmrers to a brndnig site on a template bion^lecule to yield an ensemble of aligned 
oonfbrmers; and perfonning a van der Waafe caleulatloB. on the ensemble of aligned 
oonfomiers to yield an ensemble of :3Sfc &ierieally allowed eonlbxmers tMt bm& to the template 
biomoleenfe. In another embodiment of the indention, the peptide may fetfeer comprise a 
plurality of two-eysteine motifs represented by G-X^-C herein n is inSep 
for each two-cysteine motl£ 

ifeotber embodiment of is a method for assessing the 

binding affinity of a protein to a toinplal^ moleonie, wbe^ln ^ af least 

one two-c^leine motif represented by where n is m Integer tbe method oomprisJng: 
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dotMng^ to a femdfeg sife on., a teBipiate: Momoie^ie to yield 

m mspmht® of Ht, mm&mmzs feat MM fee template biomolecule; and Galcufetog the ratio 
Nb/H* wbicb m mMmimo of tfe biidmg affini^ of fhe protoia for tlie tmi^Iate biomolecule. 

18829] Yet a&o&er e^odimeBt of fee ^seiit i&vesition is a method for assessing fee 
bmdfeg aflnity of a protein to a tmipl&te molecate, wberein fee px>tefe eoi^nses M least 
one two-eysteiimx^ m integer, fee metted co^iising 

the stops of: scremfeg a population of candidate peptides comprising at least- one two- 
ey^efee O-X^G wimm m. is m integer to yield a pMrality of candidate 

peptides Siat can potentially form m ioti^olecuiar disulfide bqM; aM per&rming fee 
method of fee ememfole of 3% eoiifomiers to a biMliig site on a tempiate 

teomolecule to yield an exisenfefe of N% mB&nmm feat bo&d the tepipiato bioiiioiecmie a^id 
catenating tte ratio JSj^Sfe which is mdicative of 1^ foindiiig aSimiy of the protein for &m 
template biomoieciile on at least one ca&<Maie peptide that oaa potenti&My fern m 
iniramoieciilar disulfide bond to assess fee feindmg affinity of tbe candidate peptide. In. 
aisofber embodiment of fee taventian each candidate peptide may comprise a pre-select@d 
amino acid seqiieixce. la yet another exnbodtaeiit of the iB^eotioii the pre-selectea atidno 
acid sequence may predispose the peptide to foim a desired secpadary stmctiire. fii aaofeer 
embodmient of the iwenlion desired secondary str^eture may te a p-tuoi. 

|0021] Another enihodimeBt of fee iwention ts a met&od for modifying a protein 
comprising fee steps of; evaJpating fee X-ray crystal structure or a iiaclear magnetic 
resonance solution structure co^prisi^g &a oxidised reference peptide bound to a target 
molecule, the reference peptide comprising at least one iptramoleeular disulfide bosidL, to 
identify at least' two mm<> acids at positions fs^dmbie. to mttamolecali^ disulfide bond 
tbrmation; spbstitutmg cysteines for the two amino, acids', iix -fee ref#0npe peptide to yield a 
modified peptide comprising at least four cysteines; identifying .an ensemble of N c 
eoB&miars of tbe modified peptide tbat eaii potentially foroi at least two intrat^olecnlar 
disulfide bonds; docfciiag fee ensembl© of M c eonfbimers to fee binding site on the template 
biomolecula to yield an a^Bembte of N& eoiiformers that bind fee template bimolecular; 
calculating tto ratio wbkh is IMi eative of the bmdixig affinity of the modified peptide 

for the template MoiBOlecuie; and lepealmg steps (i>(v«) to yield modified peptides baling 
cysteine substitutioxis at dife^e&i posltidtis so as to ideMify niodiSed peptides wife fee 
highest ratios. In another embodiment of fee invention identifyfeg an msenfele of N c 

conformers of fee modified peptide feat can |>oiaj!£ia% at least two mtrstnolecular 
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diMfc&de feoiids jsay oaMprise Hie slaps of: iteiM%Mg a first eonfernier of the peptide thai 
cm. potentially &im a first defining a first dfeoifide-bonded 

loop; constraining the ;model- : by : t3?e -^tst disidSde feond^^ confotmm 
of the peptide thai can potentially form a second MrmBofociilar disulfide bond defining a 
second longer disnifide-honded loop. Id another embodimmit of ilia iiiveMioB, if a second 
conlbrmsr is not idmitift^ 10 attempts to 4$m$& said eonformer, '#3$ 

method may finther comprise tlae steps of: eiimiBating tlie first disulfide bond from the 
model; i : dmt%Mg & fitsr ^ can potentially form a first 

i^mmlmmbM disulfide bond defining a diffeeet fixst dis^lfido-boiad^d loop; comtraiiikig 
the mode! fey ikm first disulfide bond; and IdmMfying a second cOBlbnner of the peptide tlxat 
can potentially fomi a second mtramolec^lar dMfids bond defining a second longer 
disalfide-bonded loop, 

[0022] Anotfe^r embodisBsat of the present indention is a ps^d for assessing the 
binding affinity of a protein to a^ tbe f rotaisi comprises a llexibfe 

loop* tike in^iod compromising the steps oft generatiag a peptide confermatioii of length N 
from h starting residue 1 and matsliing to a taxgat residue I H on fee peptide model; 
aeeeptiiig tbe loop oonibmiation wlien the deviation between residue H and fee target residue 
is small; closing the loop using a g^iH^tic i^irmBlmtiOEi method; selecting the residue 
conformation by the method of perlbrnriBg a vp der Wrts caieulatio^ on a mBliiplioity 
eonfbrmers of the peptide and si&tmcting tboso con&rmers that eamiol form an. 
intmmolocular dfeulfkie to yield an ensemble of M^mm^By allowed eoMbroiei^;.: Mialysiijg 
fee ensemble of steri to yield an ensombie of H c oon&rmers thai can 

potentially form : aii,;int^i2i9iecBl^ disulfide bond; and calculating the ratio MJN 0 which 
rapros0nts the i^e of disulfide bond loop closure in the peptide and generating peptide 
baoKboue cqotdiiiat^ fpi tha C-Xu-C motif horn standard -.bond angles, bond lengths and 4 
^ m dihedral angles rmdomly oM allowed regions of a. 4 * R^naehandran 

mm for omh mmdm. i*> ywM fhe irmliiplim^ an 
ensemble of sm-fase loops; and esfenaLting the binding aiSnity by testing the docMng of the 
full mini-protein ensemble and peptide t^ 

f0023i Another ^mfoodinieiil: of fee indention is a protein a computer- 

assisted method for use in rnodi^ing a protein eompiising the steps of: inpntting a peptide 
seqnenee and paramot^s Ibr analysis into a compute 

generating 4* W* m i generating a peptide hae^ det ^aals clieofc of the 
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backbone; calMilating a solvent accessible surface based energy of ail coBfemi^s^modeliMg 
the disulfide^); performing a van der Waals dieefc alter each roiamer is added; asid 
caloni&ting a salient accessible sin^aoe based energy of all eoniSrmers* 

P§241 An embodiment of the present invention is a protein produced by protein 
miniatini^MioH coB^tismg modeling a protein to hme die n^^ conformation 
using ih& method of inputting a peptide sequence and parameters for analysis into a 
eompnter-assisied modeling program; randomly generating % % m generating a peptide 
baefebone; perferming a van dm Waals efesefe of the backbone; ealenla&sg a solvent 
accessible siirface based energy of ^ performing a 

van der Waals ofeeck after eaek coiamer is added; and ealenla^^ 

based energy of all sonfto number of amino acids in the protein, 

P62S|; Another embbcimimi of the present invention is a protein eapable of docking 
into a binding site wherein by 
the iiiirodnetion of a disulfide bond by the method of inpntting a peptide se^pence and 
parameters for ^Balysis into a cornpnier^^ randomly generating &> 

\\t and d>; generating ^a peptide backbone; perfbrmmg a der Waals check of fee backbone; 
calculating a solvent accessible surface b^ed energy of all eteformers; modeling the 
dfeii!fid0(s); performing a van der Waals cheek after each rotamer is: added; : and. calculating a 
solvent accessible surface based energy o f ail confimners. 

An embodiment of Che present invention is a protein, created by the rnettod of 
inputting: a ' pepitle ^^tience ■ and parameters fer analysis into a eompnfe^assisted modeling 
program; randonily generating 4> W and generating a peptide baelcbone; perfonning a van 
der Waals cheek of the backbone; calettiaimg :a solvent accessible surface based, energy of all 
confbnners; modeling the diMMe(s); per&iinit^ a van der Waals cbeefe alter each rotamer 
iS added; and eaiealatij^ of all confbmers, baving 

tM cbaractexi^lie of ii&ibii4sig : the binding of a virus to a cell: wbereiri tlie preiein is based 
npoa a terdary stmctare of a toxin and comprises at least one loop constrained by a disulfide. 
In still another embodiment* and ensemble of intimnoleculax disnliide bond-ferming 
eon formers of said loop firom tbe protein m 

[09271 Still another embodnnont of the present invention is a protein having 
decreased oon&on ational entropy loss upon binding to a template molecule in comparison to 
the natnr&iiy occnrriiig protein due to the constraint of at least one loop of an unstable region 
of a protein in conformational^ and other ikmx disulfide 
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bonds fdmxd in the iiat^lly oeciirriiig protein using the method of inptittmg a peptide 
sequence and parameters for analysis into a.-eamputet^sisted in^^lteg; pt^gmn;- ratJi^omiy 
ge^eratmg ^ 5 ij/ aiid €0; gmeratmg a pe^^ eheek of 

the feacldsosie: calculating a sotamt aicoessilsk surface hased energy of all eontemer; 
modeKhg^ check ate each rotamer Is added ; and 

calculating a solvent accessible surface fe^ed ^^gy of all crniSmrntrs, M mm&mr 
embodiment, aii eBseHiM&of to 
of the protein may be produced fey fcis metbodi . 

£88281 As embodiment o^ is a pjrotem pn>cfeced by a computer- 

assisted method fer use is modifykig a peters co^rismg the steps of: inputttsig a peptide 
S€^ueaee m4 pai^ietem fer analysis; searet&ig eoii&mia&nal space m the allowed regions 
of the Rainach^di^n plots ; mMB^^g th^ N m& € rt same 
as the Mgh i!#s&!utiOti :: stm^tuire; e&eddisg i&e ha^^ cm^omm; aligning the 

conformer to the high resoluticm simct^re; arid perfommig & va& der WMs ealotiiaSon. 

immj Atiot&er mibvMmmt of the present im^entlon is a protein modified fey the 
method eotripiiskig tfee s^eps of; evaiiiafeg a X-rsy crystal structure or a imcfear magnetic 
resonanee solution stroetee eompti$feg an oxidised refeenoe peptide botmd to a target 
i&oleeuie, $w mf^mc^ p^ptiM mmpnsmg at least oxm intx^moleeular disulfide bcmcl, to 
identify at least two amino aeids at positions fa^omble to mtsm^oleeuia^ disiilficie bond 
fbimatiiDn; substituting eysteiBe;^ acids in the r^fersnce peptide to yield a 
modified peptide comprisiixg at least foiir eys*et&es5 iderffyhsg m mmmMp of N c 
eOTfemers of the modified peptide & form at least two iiitra^leoular 

disulfide bcrnds; doeMng t^ template 
blomoieeole to yield an msemMe of Mb coiifbmers thai Mxid the template Momoleeule; 
calcitlatlng the reaioa ISI^N* vMeh is indicative of the binding affinity of the modified 
peptide for the these steps to yield Biociified peptides 

having cysteiiie substitutions at different positions so as to identify modified peptides with 
the highest NtMt ratios, 

p§30! A fsirther aspect of t&e ixwentlon is a computer system that can iH^lermesil the 
described methods. The computer system has a soliware program coded to perform the 
described methods. Pi^ferahiy, a sotlware program would read protem se<|ueaee data from a 
database or from an input file. One embodiment of such a eoxr^uler system for desig&hig a 
modified-protein melndeE a database containing a set of protein sequence data and a software 
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program coupled with said : database . for mieractiosi with the database. The software program 
is adapted fer perfoBrdng fee ^ieps of g#*ei^ing : randomly ^litbmati^iia! asigtes from fee 
set of protein sequence data, generating a protein feacid* one mmg the con&HB&tiosial angl e% 
parformmg a van der Waais calculation of Hie proMii hacldxme* calculating a solvent 
accessible s^faee based mm$$ of conlbmiers, modeling disulfide bonds in. the protein 
ha&lcboBe, perfosBiiag a \*m der Waais calculation for the disirf&le bonds, calooiatmg a 
soi^mt accessible soirfece based <mergy of eonfemi^rs that are generated iii pinions stefs, 
and creatmg the ehara^tmstfes found m Ifee afeo^e steps. 

[0931 J Another embodiment of a computer system lor desig&hig a Modlfled-p^tam 
lias a database contaMmg a set of proteirtsegi*eme data and a sofl^am program coupled with 
said database. The software program is adapted for performing the stops of generating 
ftodomly conformational angles in allowed region of Eam&ch&n^ set of 

protein sequence data, g^iefatmg a protein baoKboije asing said con&iMatlonal angles, 
determinnig: disulfide bonds m the protein backbone, calculafeg linear oomfoimasrs, 
calculating :mlvmt accessible surface based energy of confbroiers thai are generated in 
previous steps, and creating the modified protem mng slxuete&l cbaract^risti cs identified in 
the above steps. The calcinatmg step or linked to an 

eKt^mal |m>gram for calculating coB&miers. 

A^otber aspect o f the medium: baling 

stored therein a. sofiware program tbat is capable of e^ecnting the metJiods described heseiii. 
The con^ter-teadable medium may be any storage-readable medium ntilsEed by a ccmpiiter, 
for pwposas: of i!iMtrati6n : but . not for limitation, may Include floppy disks, "hard drives, 
storage drives, disk packs. ROM, R^M ? PC cards, optica! media, and magnetic media. 

£©©33! Other objects, ioateres and advantages of the present ixreention will become 
•apparent from fee following detailed description. It shonld be nnderstood, however, that the 
detailed description and the specific emimpies* while mdicatiag pi^ferred embodiments of the 
im^eiiti0B ? are giver* by way of ilit^stmtioii oiily, smee various changes and ro^dificatioiis 
within the spirit m& scope of the mvention wi^^ those sMMed in the mi 

from tMs detailed description, 

BRIEF SIJMMAHY OF THE SMWMGS 
|0034| The fellowmg drawings fe arc iscMed 

to further demonstrate certain aspects of the present invention. The iiB^e^ better 



\mdersiood by rM&rense to one m tmmMihss&: djrawfeigs :: iB comMB&iioB with .the: detailed 
description of specific embecfeneiiis p^ 

100351 FIG. 1* Ixr&a tfc starting pasMoiis <>f Ifee 

three atoms. The letters p. cj and; T are ifoa veet0rs.: ^kpse le&gffis are tie standard bond 
lengths. MB, a ternporarj^^s^oa of X, is in tfee q direction. The distance belweei3 G and XO 
is same as the bond length, 

|00361 FIG. 2. Two M^oesslve 
polypeptide feaeM*om Rotation ^ is denoted by % and rotatioii atooM Ca- 

G bond by \{/ about 

|0037] FSS* 3A ? FICSv KC^ 3C astci FIG, 3D. ^ssigsBBeiit of f a&d ^ aiigles. 
The iess-fav^red regions, boiitided fey dashed M&es, are given 30% point pairs less fbaai the 
favored regions thai are bomided by solid lilies. The pictures are respectively FIG, 3A ? Mr 
ALA, FIG; 33, for VAL, FIG. 3C ibr C3TY S FIG. 3D !kfM3. 

|0O3S| Ff<3* -4- The. diagmtn'deMoastrates the two cysteiBes (i aM j) feat are assigjaed 
for disulfide modelfaig, Tbe positions of the stii&r SG are restricted on the circles that are 
obtained by tlxe rot^tioti along the Ca~CP hood. 

1 0039} FIG* S, The MPMOD ffew chart The input parameters (step 1 01), such as the 
peptide sequence and disulfide bomi cormeotivity, ate loaded, then the coafermatio^al angles 
o>) are generated in tto§ J&ur (step 102). Tlie: : #oms o f main ehaiii and side <Main 
.are generated base on tbe angles, T&e yan der W&als cheefe for the 

backbone atoms awi sido cbaln atoms (step 103). If there is a Van der Wa&ls violation, i ho 
ednformer wifi. be:x$jscte& It will go back to get ^^JfeoE set of confbrmayqiial angles until 
the peptide is fmish&d without any atom collisions. Then the coordinates of peptide am 
recorded apd the solvent aooessibie siurfeoe PAS) based energy is calculated (step i04). T&e 
disulfide bond is modeled to sea if tliere is a disulfide bond is possible for the two residue 
pairs (step 105). If a disulfide bond is possible, the SAS energy for tins confotmer is 
calculated. If a distdfi^ is tried 

ajid the procedure Is repeated until a eonfonner with a disulfide boiid is obtained. Finally^ tiie 
SAS energy is calculated for this coufe 106). 

[0040] FIG. && and FICS, 6B* Comparisoii of probabilities obtained ironi modeling 
with the equilibrmxn conMaxit obtamed from experiment (Zhaiig & Saiyder, 1989), The 
probabilities have been scaled to tbe experimental values, The scale factor fer CXC and 
GXXG was defeied by &HE&E^}^^^^ SCBxp) is the smn of all tbe expeiimetita!. 
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values and S(Mod) is the sum o£ &S modeled values. Eaofe mSividimi pr^b^biMSy of the 
modeled: coitfoimer is ttMtiplied l>y the scale feetor K to get th^ scaled value* The dark bars 
are ilc>m ^xpexim^M and tli^ ligM m FIG, 6 A shows tlie values for 

[0041] FIG, 7. Til© variation of :: ffee;mti6 K e /N<» as the iiimi&er of coBfermer feof^ases 
in tfea eiiseiBbfe Here, only the C3Q£C series is gi¥e&. The serfes is also s&nilar.. 

|0©42§ FIG* 8. Compariisoe of probablaties oblaiiiecl from modeling with tlie 
equilibrium c^is Mm mp&imm^ ^ 198^), Series 1 is the 

ratio of munfeer of hydrogen bonds hi the SS bond closed cordon by fee total 

&uiBfear of coiiteiiar. Seiies 2 is ^ 



Series 3 is fixmi tfae from expermiesxt S eries 4 Is the ratio M^N^ from the W sphere atom 
modes! In order to put them into ;0&e rpict^re,..^ scaled ;jEtU[ . the • series- in following factors; 
series 1 (nG00}> series 1 (*iQ000) y series 2 (^100), series 3 fSi}, series 4 (*IQ90), 

p043] FICS* 9* Iiiteractioiis for the p^tide-str^fevidln cotnplex.. Here the peptide 
kas two disulfide bonds that are cross-MBfced, The HPQ motif is sitting in the bliKling poefcsi 
and there ara threa hy dmgm bonds im^olvfcg in the interaction tor the complex, 

100441 FIG. 1©* DisMSde-hoaded random conformations for tlie enseals 
CCHBQCGMYEEC. Each con&rmer has two cms?4iaked dkidfMe hoiids. Tlie mmiooiiy 
generated conformed lias various coiiformaiiojis. 

PS45J FIG. II. Tlie iHiiiiber of chances for each residue of tlie peptide 
COiPQCGMVEEC to collide with the target streptavidin. 

[0046| FI€5. 12o CorreMi asi of t&e ^Hiding ratio" with the observed binding ecmstarxfc 
The straight line is iitled by mhiimizmg the somma tax Has ^S^i M (M3f - AG^)*, 

#0471 M€. 13A m& WIG. X3JBL How chart of the MPMOD program fFast Mod). 

190481 HG. %4A FIG, MM* Flow chart ofthe W 

FIG, I S, Flow chart of the I^MOfi progr^ (Loop Generation), 

[00501 FIG, 16, Mow eha^ of disulfide bends. 

PSSS J FIG. I7o Flow chart of fee MPMOD program's hmdfeg test, 
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DETAILED DESCRIPTION OF THE INVENTION 
|0052] The present application includes methods* of modifying , a peptide to increase 

the binding affinity of a template molecule by increasing the stability of a peptide by 

decreasing Hie conformational entropy loss upon binding to the template molecule. 

P0S3] A or an, as used herein in the s^ecigcatiOT, may raean one or more tban one. 
As used Mmn fe the elaini(s^ when m0& ,m -^i^^m^m:^^ tfee word ^eotnprismg^ the 
words "a" qr ! W may me^n one or more tfeni one. 

[CI6S41 AMffiw, m issed herein, mm msm at least a second or more. 

Based upoB a tmimry strugtmm, m nsed herein, refers to a structure that 
possesses a similar backbone stractere to feat of tfee. original structure that it is referred to' 
being based upon. 

|0O56| CcMfermer, as tised herein, refers to various. nQn-siip«rin^osatola three- 
dimensional . arrmigammts of Moxm that are :&itercbn¥^i€ble:.. mtM^t breaking covalent 
bonds. 

[0057$ C^strafe^cl, as nsed tedn, refers to a MtAn in ifee con&nnati<>nal space 
that the peptide niay adopt 

j[«3©5$|. PtsmllSde ferfdge and dls^lSsfe hernd m iised herein, refers to a cogent bond 
between the sid&r atoms of Wo cysteines. 

|00591 Ge&erMe, as used herein, refers the act of defining or oogiHatmg by .fiie use ©f 
one or more oper^oim l!M Mitddnals using the invention may create the matter or data 
themselves or loqato : tke matter or data elsewhere and utilize it In . the practice of the 
invention, 

|086©| Loop, as used herein, are tarns in the polypeptMe chain tliat reverse the 
direction of the polypeptide chain at the surface of &e rnolec^le. 

10061] Rotamen as used teein, rafters to a low energy amino mid side chain 

fQQ62J Peptide* as nsed herein, reifers to a ehaixi of amino acids with a defined 
secpenee whose physical properties are those expected from the sum of its amino acid 
residues and there is no fixed fcree-dimei^icmal structure, 

|O063] FrofeSsa, as used heiein, refers to a chain of ^mino acid resldiies usually of 
defeied sequesiee, Usjaj^ Tfee pclymerizati 

produces a pretein results in the loss of mm moleenle of w&erAom mch amino acid, proteins 
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are often said to be composed, of ami^o aeid t^Mi^v JftetEl proMii molecules may contain 
m many as 20 different types- oi mimp . acM residues, eadi o f which eoetams a dlsUiicti¥e 
si&e chain. A protein may he composed of imiliple paplides, 

[0064] Straeferal Cfearacterlsties, as used faerem, refers to the di^actetisliics that 
are determmed. mmg tfee computer-assisted program, such as, but not limited to folding 
clwabtemties,. disulf^ blading a&iity, aggregation, solubility, im^nogenicsty, 

stafelility, etc. Thus, on® of sldll M art realizes that tlie pmssBt mveBtion is used to 
deiemime any structural characteristic of a protem :snd this cbaraeteri stic may ha enhanced or 
reduced depending upon the application of use. 

Template molecule, as med lierein, refers to the protein to which the 
modified protein is hitidiiig^ 

II, MFMO0 

PCI6^i Gombliiatioii of the random search of the :cOBf03^iatioimI, space, in the allowed 
regions: of MamachaEdraii plp% using the simple hard spliere model to generate the stereo- 
chemically acceptable mnfom^m, and a flexMe disulfide bond niodeihig, poyides a simple 
ami useful too! to study the behavior of cyclic peptides. The "mW* or probability of SB bond 
loop closure as defined, by N C M 0 converges to a stahle value when the ertsemBle has more 
than 1000 conformed For the CJvC and C3QCC series of . peptide^, the modeled, probability of 
loop closure behaves the same way .as ffie ^xperihienmHy : detemiined equilibrium €onsiairtiC c 
lor ail. the torn types of the peptides. Both compare well aSer a cosmMii scale factor i$ 
applied. The geometry or van der Waals interaction plays a dominant role the loop closure 
for the small peptides €MC and €3&i€+ One of skill in the art realizes that the MPMOD 
method of protein design is not limited to protein pharmaceuticals, For example, it -meludes, 
but is not limited to, the use of the MPMOD method to desiga proteins that may be be&efieial 
as a diagnostic reagents, research reagents, pesticides or herbieides. 

10067] The program (MPMQ0) is an efficient method to generate disulfide-bonded 
coBfbrmers, It takes shout 10^0 GPU minutes to obtain 4000 disulfide bonded confermers 
CXXC using a Linux system on a Peuiinm^ffi 450 MHz;, Because the conimner CXG has 
higher probability of collision, it takes about 3 times more CPU time thai to generate the 
CXXG. However, the consnined CPO time strongly depended on the criteria used to generate 
the con former, 
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|®9©3;| The basic program oaMpises tke MIoMng steps. The mpxt 

paiutBeters 5 . such as the peptide sequence and disulii^ 

eoBfbrmMoiial angles (4 ? w, m) are generated in tbe four maps. Ttie atoms of mairi ..efeaiii and 
side cliain are generated base on the angles. The van cier W^ais cheeks are perfbmied 
separately fer tile backbone alosms ^ Waals violation, 

the CGsaSmm--yM^:^P^^ ft will go back gM apoffi^: set of «^ 
HMil fee peptide is fioishM wife^ 

recorded a&d the solvent accessible siirfeee : <S^) :: b^dxm©r^4s ■es$&t3ig&& Sh& --disulfide 
bond is modeled to see if tbere Is a distilfide bond iS:;&cjHS*afefo-. 3S*r: ■^b»,==*^i& jjre^Mtws pairs, if a 
disulfide bdiid is possibly the S£S mmgg for this eoBlorBier is qatalatsd. If a disulfide bond 
Is not possible* ano&er get df eoiifeti^tio&al angles is trisd and Sie probed^ is repeated 
until a eon&rmer with a disulfide bor^d is obtmt*e& Finally* the S AS energy is calculated for 
tiiis confonper again. 

MPMQI5 m be used to generate dis&If& cdniatmers and/or linsar 

conibrmem. To generate linear ooiifemers m coni^tion with dlsuMd^ bo^ed ecmfermers 

EEL Frok^ry^tlc Peptide display 

^ analysis of aiMurally obct^ng and artiBei&l protebi libraries bas 

been greatly iniproved by the devel^iiie^it of v&noiss ^splay** metiioddlogies. Tlie general 
sbbeme bebmd di^l&y tsebinqnes is fee adYa^ageoos egression: of peptides, and tfaeir 
disposiiipB: Ob sonie bioiogioal surface <pmg^ cell, efe). The ability of difSermt versions of 
tbe displaymg organism to preset milloBs and ixffi^ tbe rapid 

serening of the ccmt^spo^dibg Ibrary for. biological feietion. 

PH71| In UVS. Patent %$&1$4% mono^aiei?! phage display & descohed. This 
me&od provides, for the selection of novel protems;, and variants thereof Tbe metbod 
eoiBprises festng a gene eroding & p^ domain of fee 

gene HI eoat protein of the filamentous phage Ml 3, The fusion is mutated to form a library 
of strueturalty relied feslon proteins thai are expressed in low ep^tity on the siuti&ee of 
pliagemid candidaies. 

[08721 U.S. Patent 53?1 ? 69S describes directed an MI 3 phageniid 

system. A protein is expression as a fusion witb the Mi 3 gene ffl protein, S^Geessi^e ronnds 
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of Biutagenesfe are peribrmed* each time selects^ tor imprcweel biological function, e.g., 
feindmg of a prstem to a cognate bMing paxfeer, 

P873J Beterodimer phage libraries are described in Patent 5,759,817. 

Filamentous phage eoxoprisiBg a matrix of c^^II |)roteiBS me^su.ktikg : .>a genome eiiecdiMg 
first and ssgobcI polypeptides of m aiifogeeoml .y assembly receptor , spl as m aMibody, 
me provided, :The : reeepior Is stirfa^mteg^ate-ii into the phage coat matrix via fee epVIII 
me^hrme ^ assessment. 

P>§74] ABofeer system, laMbdoid phage, also can be used display purposes. Ib 
UvS. B&tent 5^672,024, lambdaid phage comprising a matrix of pTOtekis enoEpsolating a 
genome encoding first and second polypeptides of m antogeaously assembling receptor are 
prepared, Hie sBtfeee-iBtegraieci receptor is available on the gurfaee on file phage for 

JW75I Immimoglobulin hmvy oham libraries are displayed by phage m described m 
Patent: 5,824,520, A single efeam axiMhody library is generated by cresting bighly 
divergent, synfhetie hypermiabl^ mid selection. The 

resulting antibodies were used to ixibibit iiitracellBlat enzyMe activity* A&bther patent 
describing antibody display k tXS. Patent 5,922,545. 

[00761 Another exaxnpfc of phage display eaa be &mnd m UM, Patent .5*780,279. 
Tbis metbod provides fer the idenlification and selection of novel substmtas tor eii^ymes. 
The method comprises contracting a gene ixisioii eomprismg PNA encoding a polypeptide 
gised to a GNA encodmg a Mbstraie peptide, which m torn is f*i&ed to HKA encoding at least 
a portion of a phage coat protein. The; DMA eaeoding tbe substrate peptide H mMMM at one 
or iBore eodons, thereby iterating a femiiy of mtitants. The :fesion : protein is expressed on 
the snrfece of the phagemid particle aisd si^eoted chesTOC# 

the substrate peptide. Those pbagemid particles that have been modified are iheii separated 
feoiti those fba£ hav^ not 

[Q077] Bacteria also have been used siiecessihlly to display proteins, U.S. Patent 
5,348,8^7, describes mission of proteins oa bacterial surfaces. The compositions and 
methods provide stable, surface-expressed polypeptide from recombMant gram- negative 
bacterial cell hosts, A tripartite cMmeric pie and its related ..rec^mbiB^t vector . im^iude: 
separate BNA sequences for dir©ctmg or tai^efeig and traBsfocating a desired gene pxoduot 
firoiii a cell periplasBi to the eternal cell snrftee. A wide range of polypeptides may be 
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efficiently surface expressed usmg this system, See also, U,S, Plants 5,508,192 and 
5,866,344 

P07S| U.S. Pate&t 5,500,353 describes another bacterial display system. Bacteria 
(e,g„ C&idohMeim^ having & S4&y^ S-Iayer protein gene 

contains one or more ki^firame fiisicms codmg for one or more Iietsrofeg^s peptides or 
polypeptides is described. Hie proteins are expressM on the surface of the baeteri mn, wMeh 
may adOTHtsgeonsiy fee eulteed as a Sim. 

IV.. MMtageise^Is 

[007.9] Whore eixipfej^d, metagei^ by a t^aety of standard, 

motagexiic proeedsires, Mixt&tkm is the process whereby ctatges omm in fee cpantlty or 
structure of ats orgmnsm. Mnt&fioii can mvfciy© ^o<SlcMion oF Sie micleotide secpeace of a 
single gene, Mocks of genes or wife chromosome, Qimigcs m s the 
consequence of poi^ imitations t&^^^ Edition or sixbstitiition of a single 

rtiaci^iMe base within a DM sequence, or tfiey mm be the mm^mm^ of changes 
mvdlviiig the insertloii or deleti on of large numbers of nucleotid es . 

P0S0] &tatIoiis can. arise spmMmmsW n result of events met. m:mrnm: : m^ 
fidelity of IMA jrs^Bsa^ genetic elements l^xansposoHs) 

wiMii the genome. They also are feted following exposure to cto^nieM or physical 
mutagens* S^eti ^ ultraviolet light and a 

diverse array of cfeeiriical smh as a&ylairig agents ^ all 
of which are capable of mtetBCtfeg either directly or 

metabolic biotmns&mislioiis) with nucleic acids. The BMK lesions indiiced by such 
eoviroiinientai agents may lead to modifieaiions of ba$0,seqiieiioe, wh&ii . the aSeeted I>NA is 
replicated oivrefjaited <m& ihm to a mutation. Mutation also can fee site-directed through the 
use of particular targeting methods. 

[8081-]- Stmcture-guid^i site-specitlc mutagenesis represents a powerful tool ibr the 
dissection and engineering of ^ (Wells mat, 1996), The technique 

provides for fee preparation and testiog of sequence variants by ititrociHcing one or more 
nucleotide sespenea changes into a selected DMA. 

f©682jj Site-specific mutagenesis uses specific oligonucleotide sequences that encode 
the 'DMA sequence of the desired mutation, as well as a snfEcient ntmiber of adjacent, 
nmnodifed nucleotides; to this way, a pr&ier sequence lis provided with snfficient size m& 
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mmpkmty to fbmr a stable duplex oa both sidm of tfee deletion jmctioii being trnmrmd. A 
primer of ^>oiit 17 to 25 micieotides hi Iang& is preferred:, with about 5 to 10 residues on 
botti sides di%ia jmicMon of the secpence being altered. 

P$83| Tti£ te&mque typically employs a ba0ieriop^ 
#sgle~strasicied and doiifele-stmjiaed form. Vectors onefil m site^ii^Gted mutageiiesls 
Include vectors siicli as tfie M13 phage. l%ese and 
their use is generally well toown to tteose skilled M tile art. DoaMe-strEnded plasmMs are 
also routinely employed in site-directed ^luiagenesls,. which eliminates the step of 
trao^ferri sig the gene of interest ftsm a pbag e to a plasmict 

{0084] Izi general, one first obtains a single-stranded vector, or melts two sir&n&s o f a 
doBble-sfeaiided vector which Includes wiim its seepesice a KM A sequence encoding the 
desired protein or genetic element Ab oligoMicieotide primer bearing the desired mutated 
seqnene€ v syn&etlc^lly preparecl tbesi asmeaied wiSi tlie siogle-straiided ONA p^paratioo, 
taking into aocoimt fee degree of mismatch when selecting hybridisation conditions. The 
hybridised product, is suhfseied to DNA polymerizmg mzpms mtih as E. eoli polymerase I 
(KLlenow fragment) in order to p03$^$@t$ tii^ s^ithesis of tfae strand. TIihs, a 

faeteroduplex is formed, wherein one stmnd encodes the original;. BOB-iBUtate^ a^qn^nce^ m& 
the second strand bears fee desired mutation. iMs heterod&pie^ vector is then ased to 
transform appropriate host cells* m^ch m eoli eelfe, and clones are selected . th&i . include 
reeornfeinant veeters hearing ^traEgeBieiit 

[0085] Other mettods of .sita^dit-epted iiiiutageii^s are disclosed m ILS. Patents 
5,220,007; 5,284,760; 5,354,670; $,366378; 5389,514; 5,635,377; md.S,7$9M^ 
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% Modified PsIymM^ 

{0036] Amino : ^cM^&Ss^si§km- ate gmemily based m the relative siiniim^ of tlie 

si^e, md/m the like. An acM sM^chaki 

^festitn^its reveals that argn)i&% cbarged residues; 

that almiiBe, glycine md/m $mm are ail a simikr sim; m^m tteat p feenjMaoiae, tiypiophra 
and/or tyrosine all have ?a generally similar #ape. Tteefce, ted « tee 
considd-atioos. argmirie, lysine an&fet Mstldme; dam&e, glycine m&pr semxe; and/or 
pheiiylalaiimc, tryptophan mS/or iymmm: are defined komm as biologically Emotional 

£0®87| To effect p<p :p^ fivdrop^ of amino acids 

may be coBSidared. Bacl> am« acid has been assigned a Iiy dropafeie isi<ie& on the basis of 
their liydi^&ofeieity aad/or ofearge eharact^risties,, fxese are: isQi^aeitie (+4.5); vaime (+4-2); 
leucine (+3.S); pfejtfai^e (+2.8): ©ystefee/cystiiie (+2,5); mzMmmm (+1-$); alanine 
(44 ,S); glycine f0.4); teases J); serine tryptaplisa (-0.9); tyrosine (-1.3); 

proline (-1,0); MstKfe^ (-3.2); glutamaie ,(^3k% ghstoiBS (-3.5); aspartate (-3.5); aspaiagmB 
(-3,5); lysine (-3.9); aad/or argmfe^ C- 4 ^). 

[90881 The Importance of the bydropafeie axmrm acM index in eM&tdHg mterkcti-ve. 
Moiogkal fiction oti aj^oieia is ge^emlly: imderst^od m the art (Kyte & Doolittle, 1982, 
incorporated het^ by reference), B ktiown p aeids may be siifesti-teted for 

ether amino acids having a 

biological a^ti^ity > In making ehaixges based xipon -flte- hydropafeic iiKfex* fee ^stkuti of 
aniiiK? acids whose bydropalM^ tedic^s a^ i^tbfe^i^ Referred, those which are witMn ^1 
are patti cuiarly preferred, mz&Jm those within +Q>5 are even more partieularly prefered. 

|00S9] It .also is isiidexstood; m .tlie art that file substitution of like amino acids can b# 
made ef&etiwly on the basis of bydmpMlieity, partiouiarly wh^re tfrs biological fe&eliosal 
equwalent protein a^id/or peptide thereby created is iiitasided tor use in iaii^uiiologfcal 
embodiments, m in eertam ^bodHBeots of the present iiweatio^ U.S. Patent 4,554401, 
mcoiporated: . by mferenee, states that the greatest local average hydrapfaiiieity of a 

protein, as governed by the hydrophilicity of its adf acent amino adds, correlates with its 
ismaimog^fe^ a biological property of tlie ptotehi. 

£60901 As detailed in IJM. Bateat 4 ? 5S4,IGi 5 ^ valBes liave 

been ^sigtied to mmm acid residues: arginine (+3,03& lysine (+3*0);. aspartate /(+3:.0 * l)i 
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ghitam&ts |H3# 1)- serine (+0 .31; asparagme fKX2); gteamine (-H},2); glycine (0): 
ikteordoe (-0,4); proliae (-§.5 =b 1 ); alanine (-0.5); Mstidine (-0,5); cysteine (- 1 ,0j; metkkmiiie 
(-1*3)5 yatee C-1.5)j letiein^ isole^cine i~lMfc tyrosine. (-23); plienylalaiime (-2.5^ 

tTj^tophan. (-3,4). In making changes based upon similar hydropMliclty values, the 
siibstitotion of atm&o acids ^^sa liydropM are wMiin ^2 are preferred, those 

that are within dkt are pa^mlady p^fered, a&d/or those within ±Q3 are even inore 
particularly preferred* 

|ll©9i| The present inventors 0oiitetd|>late that -structurally stailar compounds moy be 
fomtniated to mimic tlie key portions of peptide or polypeptides. Such eoxnpo^nxis may be 
termed peplidomiffietfcs. 

Cartaia mmiMcs tfeat mimic elements of p roiem secondary Md tertiary 
strucfere are described in lotoscm ei: al (1993}. The imderlyiag r^tioHal® hdniid the use of 
p^tide niiiBeties is that tie peptide backbone of proteins exists chiefly to orient amino acid 
side chains in such a way as to J^ilitafe such m those of antibody 

aiid/or aixiigeiL A peptide mimetic is tbus designed to penrdt molectdar interactions similar 
to tbenatiira! molecxrfa 

$96931 Some successful applications of tbo peptMe miinetio concept tee focused on 
inimeties of p-toms witSii?: proteins, whicb are known to be highly antigemc, As discussed 
herein* possible p-iurn sfraetee within a polypeptide can fee predicted by computer-based 
•algo:tit!lms. 0bo£ the component amino acids of the tisrn are detennined, mimetics can be 
constructed to achieve a similar Spatial oiicsitatioii of tie essentia! elements of the amino acid 
side ohaiixs- 

|0OM] Other approaches ha^e focused on the lise of small, mnlttdisi^^ 
proteins as attmotive structural' templates for producing blologioaliy active, cotifottxiatioiis tbat 
mimie: the binding; sites of large proteins. YiMetal (i99B% A stmettsM: motif thai iap|)ears 
to be evoMiOBarily conserved in certain toxins is a small (30-40 amino aeids)^ stable,, and 
highly permissive for mntation moti£ This xnotif is composed of a beta sheet and an alpha 
helix bridged in the interior €ore by three disulfides. 

pSM>Sl Beta EE tiims have been mimicked successfhlly using cyclic L-pentapeptides 
and tbose with D-amiiK> acids. Weissboff et al (1999), Also, Job^nnesson et aL (1999) 
report on bicyelie trrpeptides with reverse Umi indncing properties. 

m 



BMSDCCfC>: <WD._ 



wmvmmmmm 



11)096] Methods lor generatm^ disclosed lii tlie art Box 

example, alpha 4ielix mimetics are disclosed kM Batests 5,44<5,!28; 5,710,245; 5^840,833; 
and 5,8594 §4 Theses structures rehdpr jj^e§^ldUdbssi = -4oaer E: ^nrip^%»in- mom ihmmMf ^ stable, also 
increase, resistance to proteolytic degradation, Six ? seven, eleven, twelve, tMrfeesi and 
iom^eea membered ring structures are disclosed. 

|0097| Methods iM gepemth^ and feata bulges 

are described, for example, M Pataiiis 5,440,013; .5*61 8,91 4; ami 5,670,155. Beta-turns 
permit changed side sufestit&e&ts witioat having changes m coiTesponding b&cfcboBe 
confem^atio^ and have aj^rofsriate termini Jpr ineor|>oratIoB into peptides by stand^d 
synthesis procedures. Other types of mimetic fens include reverse and gaiBma terns. 
Reverse turn in ..UJgL Patents 5.475,085 arid 5*929*237* and gamma 

turn mknetks are described in U.S. Patents 5*672*68:1 atxi 5,674,976, 

The MIo»g ex^opksi are iB0lMe<l • t& demonstrate preferred embodiments 

of the iiwmitioii. It kheuld foe appreciated fey those sfciiled m the mi that tbs teetaiques 

disclosed m the examples which fellow represait tedfeiiques discovered fey the inventor to 

fenetion well in &e praetiee of the xhwiiMoi^ and ttes can be considered to constitute 

prewired modes ibr fe practice. Howwer> those of skill in the art sfaonld, m light of the 

present disclosure, appreciate 1&at mmy changes can be made m the specific embodinients 

which are disclosed arid stall o&taki a like or similar result wiiioiit departing fi^om the 

concept, spirit arid scope of the mention. More specifically, it will fee apparent that certain 

&gents : :-ttmt -are "both' ^hemic^liy a^d-physiologioaily - related may be siibsdteted for the agents 

described herein while the same or similar results would be achieved. All snob simitar 

st#stitntes and modifications ^pai^ to tb the art are deemed to be tvithin the 

spirit, scope and concept of ife the appended claiins. 

Example I 
CosistrMciioB: of a polypeptide 

A unit peptide was generated using a rofetiomi matrix |M]5t^* 0eSreys & 
Jeffrey^ 1950J lor the effect of a rotation on the coordinates of a p^nt^^^^ an a&gle 0 ahcmt an 
axis ttmmgh the origin ha^iiig the direction cosines X, v. 
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F cos0-* -cob 6?) $#£1 cos#)- -#sja.#' Mv(t-om&) + p$m& 
j lv0-cos^)-^sin^ ..&^-tm:itfy+&£&& m^&^^i-mB0) 

[9100] Givmg the coordinates of three successive atoms A, B: and C* the ihvm 
oomponrnM % P ^ v in th^ maMr The fourth iom X can be iterated based 

on this matnx mid the dihedral a&g!e Qm bmd angle aCB~C~X) and the bosid 

kngfe d(C-X). JMi gives aifi%i^ 

[0101] For tfee coordinate used in FIG. .X 'tibe positiQa motors of the feee atoms A, 0, 
C are rl 5 r2, r$L The vectors p 3 q and V are x^specti^siy p-t2~rl, Q-r3-r2, 
where d is die OX bond length, The nmt vector m^pxq)/i;gx^| is normal to the platie 
formed fay atom A* B and XQ is the temporary position in tiie q direction, The position of 
atom is first rotated to tfe© X position with the rotational ass it arid tot&iiosial angle 
where a is -fee bond snglo of atom B s C and X. The final position of X is obtained by a. 
rotation with the axis q and the dihedral angle % (one of the Sir ee dihedral angles ^ \j/ and m% 
Both the rotation fi>r vt-a and % are olocicwlsc looking down the relevant vectors. Hie angles 
are positive if they are clockwise rotation and negative if anti-cloekwise rotation. The final 
positiori of fee atom X is expressed as 

where the iloiktios? JMf a has hem used for the xnMrix [M|^y of Eq. (I ) with a as a 
imii vector in thedirectidB of % p^v. 

[0106] StartMg from a imii peptide, all csf other atoms of a pol^epthfe can be 

detennined by Eq. (2). The backbone atoms (N s Ga, N, O, HM, HCa) were gmer&ted 

BlaMhg me of the eoiifermatiomal angles . m) and the : staMard hoiid : : mgles md hond 

lengths (Momany ei aL t 1975) are listed hi Table 1, The haolcbo^e atoms qb residue i are 

bisHt fey feeibllowiag paramat^ atoM Hi by {N^i ? Ca^ , Gj-i> ^i>-O^OotU-r€^Lir 

'3^Vii^t^>5 Caj by {Qa^x , v .Hs , ? a(C^^-€aj), Q by |Ga , Hi 

Cw * Wi 9 aCM^Cai-q), d(Gar <%)■}; CN hy |%i , H* » %>%. y a(^M>^ d(Ci-O0} and 

HNj hy $Cqkt* , €Vu Hi * a(Ca^-Q.i^#^ ^^rB0>> Wii^ cx k the bond a&gJe fotmed fey the 

three atoms m the parenthesis and d is Urn bond length of two atoms. Oj3 aim KGp are treated 

specially heeaiise of the telrahedral geometry With N and C atoms. Both atoms do Bot depend 
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on the dihedral angles (0, \\* y m) r Of the two pos$*l$e .^^§o!^;©lf the Cp atom, Oi.o one 
correspaixciing to the L-arnino aoid residues lias been tise4 : tM-oiig^Mt Use studies. Beiice ? Gpi 
atom of residue i is buili by {K;> Cat, Q, 109 J% - C pi )| arrf HGoCi by {Mi , Ca* Q 5 
- 109:5% d(Caj - MCa*)} FIG. 2 sho ws the diagram for two mm essiye nnit peptides. 

|01 #31 The side clialn atams of a polypeptide were built in the same way as shawa for 
hrriMing the h&elfene atoms, Tlie bond lengths aBd band angles : «e taken from the 
ptiblishM values (Mom&ny ei al 1915% There are ma^teally & of dihedral angles 

(Xl , x-* %4) fer the 20 amino acids. Surveys ofe^t^og^ of proteins 

and small peptides show that M x ^l^ afe highly 1978, Benedetti ef 

al, 1983). Ponder and Richards (1987) stu^ X a&g^s using 2273 

residues obtained from 1 9 protehi crystal structures with a resoMtlop higher than 1.8 A and R- 
factor below 0.18. Peader and Mcliards &rnn& that the popular 

higher in smtm ■ preferred yalti^s. m& tet :thfe steii^rd deviation was smaller than previously 
published values. This indicates that a rotamer library can appro^dmately represent the 
behavior of a side chain. The mtaxner library by Fonder and Richards (198?) was hed for all 
the diliedral angles. Ik order to reduce bias to as Htfle as popifele ibr addition of the rotamers. 
ilie rotatners: were selected that did not collide with the faae&one atoms for each residue. 
One set of rotanrers was picked randomly from ail the residues. However* this set of mtaniers 
niay have a van der. Waafe contaot violation* Three random tries were given to increase the 
ch ance of getting a ^uitaMe- Set roomers* If isb&e of tlie three tries -satisMess the \m desr Waafe 
cheeky the backbone is rejeeted and the proeedwe is repeated 

Example! 

Determi&atios* of ffie dihedral angles ^) 

101091 The subroutine rati2,£ Km ^Hnmerical Reeipes ?? (Press et ai, 1986) was used 
to generate random imnibers that tuiifo^Iy distribttte in the range Iknn 0.0 to 1,0, These 
random values have bo sequential correlation and the period is practically infinite. The 
conformational angies f ^ \\f) are assigned to be ax + b where x is the random valne 0<x<LQ* 
a and b are two constants which are adjusted so that the two angles we restricted to the 
allowed regions of the Eamsehandran plots . 
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TABLE 1 



B oiid aiigle (°) 






1.15.0 ! 




1325 j 

i 




121,0 




1.453 | 

1 




109±5 


C r C<af.{ 


L530 | 




120.5 


Q-0= 


1.230 




189.5 


Car-CP; 


1.530 




109.S 




1.020 


[ Cu-^iM 


1.21.0 





mm 



Table I *■■ The bond lengths ari bond angles nsed for hniMmg polypeptide* The ban*! imgle C~ 
Ga-H is.'iiiot as rigid as other hond angles. Therefore it is allowed to vary ;x5° aroinid tbe 
average value 109°; 

* If residiie i is Fro ih^ bond Imgtb of Cw^Ni is I SSSA* 

[01 OS] In order to. sample the con&niiational space efficiently, tha-feadkfeone dihedral 
angles tji) of a .f>t6tein are : ^mded; i&io four categories, om lor gl^m^.one for proline, 
one for the CB-brancli^ amino acids (VAL, ILE and THE), aM qbc for all other : amino 
acids. Glycine, with one hy#ogen atom as its side chain, adopt a wide rsage of 
conibiiH&iiom and the nuap is sj^mnetrieal due to absence of m R substitute on the alpha 
carbon (Got),. Proline only adopts a very narrow raage of coafbrmatioii space because of the 
pyrrolidine rmg attached to di^ B md :Cm MaiM ?; w^ coo&imatioii greatly. 

Alanine is a prototype L-annno acid whose coiybmationa! space « approximately 
represent that of other ammo acids except fer pmiiiie and glycine. However, dee to the two 
branches cm the CB atoms, the mnmo acids (VAL, ILE and THE) have mme restricted 
coBfomialioiiai space than ALA (Schemga, 1992; Chakrabarti & Pal, 1998), FIG. 3 shows 
distribution of the coiiibrmatiorial angles ^ : ^\f6tt'-m$$& As the numher of the raadoni 
values becomes sufficiently large,, the points will evenly dlstrifeiite in the allowed r^gioBS. 

MacAxthw Thorntan (1 993} made m analysis of the eoiifbraiatioiiat angles 
for proline residues from BOn-homologo^ X-ray crystal strrictees deiennin&d to Z5A or 
better; They showed that majority of the confomiation angles are olostered about the mean 
values of ^ ^ -61% -3S^ for the a region and f, -65% 150^ for the p region. The early 
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mmputsuoxm (a^,^amacfeaiidraH et al, 1963; Metnethy and ScMeraga^ 1965) with hard- 
sphere potential* a good first order approximatiOB^ stewed that the region airoiiBd the *p* 0* 
was not allowed. When reaifeflo p^tentia! was psed ? this -region feecBnie partially allowed. 
Based oil this eorisMeraiion, #e the faired i^gi^ ^Moli are 

bosmded fey solid 1mm and 1 ms-fmrnM ragioBs which are bon&ded by dashad lines. It was 
detennined to give tfae 60-8094 of a ofeance o^^^ 

ferored regies. For all M^ aeife and pr^liH^ ,imMy" : &%i''ofw^^''^^^ 

were assigned to the i> positive region Thia^ is similar to the one 

proposed by S<wdh^iiM et al. wife ii^ aspects; 1) the 

investors sampled £<^&iM;^^ to fee closer to rhe Eaniac^andrsii plots and two 

small areas were added sod 2} the ■fm0^---M^^0^^'-^^ map is 

tf %OH-^ey^a- These aspects speed up the niodeHng of the peptide significantly, 

[6167] The trass aiid <5is l^^ &nd 0°. 

Tfec non~Pro amino acids are xkrored m trans ferm by a ratio of ^pjsroxknafely 1 000: 1 * Wife 
proline, the trans form is m£y 'laired % 4: L lfeerelctee : tioji-^o amino aeids w^e set as 

100% tram fomi aiid the prol&fe was givm iip fe 5% (or optional) cis form and 95% term 
form. The dihedral angle o> w. also allowed to have a fiuomatiom about 5° around the value 

180 or 0°. 

Example 3 
y&m der W&ats sf eric gc^tacfe 

[01 Ml The hard sphere Mom model was assumed as the scoring fenctiosr to eliminate 
grossly improbable eQiifdrmprs, Ra^iachapdraii (1963) BSed X^ay data to detennine a list of 
contact distances for each fcted of atom pm (see &e %ormai ?y and "extreme* 5 distances of 
Table 2>) occurring in proteins. These distances are about 0.3 ^ CIS A smaller titan the 
snirmiation of the wars der Waals radii of two atoms CG^e^zotti, 19§3)« Ga^ezzoti cooolBded 
tha^ file structure was less stable for tke distance of -^^s^.&&t-^-^m fm the distance of 
formal limit" Bk^^er^^ extreme Irnii 

are hydrogen bonds or otber attract^ oalibrated the van der Waals 

radii of atoms using an inverse RaniaehaBdra^ plot> The calibrations were based on the 
comparison of the Eitoaoliaadraa plots Stained ftjm Mgfe 'f^1^^-%^. data of proteins 
and peptides wr& the allowed conlbimational space for ths cli-pei5tMe rxioleoaiar models fenilt 
from the pufolisbed standard bond angles and bond lengths. The calibrated contact distances 
for each atom pair are about M to shorter than the ^ttexse Itoif • (Table 2% 
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pli9| Qiit of lie teoe kWs the sliort contact distress, tafe fee ^m&mm limit* 
and the "calibmfed" distance warn used for fee m to Waals GM^cfc, imlass ofeemdse 
specliiecL Tte "extranie" distances were ased only Bar the baclcbcms atom pairs e&eelced. "The 
^oalferate^ chain to side chain, or side chain to bacifco» 

atOBi pair clieck, Tbe xmmm to use Wm coiitaet distances were t!i^ MIowiBg: (I) to give 
some and sligMly mors |l0xiM^ the side chains, (2) to 

e<5Hip3!*s^ by fee fi&e^^^^ used to 

feoild t&e polypeptides, (3) to loeiud^ some attractive features ia fee 

coxiibrmer. (4) to save coffip^tiiig tixns especially whim the side efeaiB ato^ are lim>iyed 
m van der Waals elsecks. It slioiiM be ii^te^ all fee possible HOi^onded 

atom pairs are eheckedv A*m& pairs In fe^ same residue are not cheeked. 









Calibrated (A) j 


H...H 1 


2.0 


1.9 


1.9 : ] 




2.4 


2.2 




0...N ! 


2.4 


2.2 


2.3 | 


H...C 


2.4 


2.2 r 23 

I i 


O...O ! 


/ / 

' 


2.6 


2.4 


O...N 


£.1 


2.6 


2.47 


O...C 


r • -TTe 


2.7 


2.55 


N...N 


2.7 


2.6 


2,54 


N...C 


2.9 


2.S 


2.62 


C...C 


3.0 


2.9 


2.7 


H...S 


2.5 


2.3 


2.3 


0...S 


2.9 


2.8 


' 9 7 


N...S 


I 2.95 


i 2.S5 


2.77 


G...S 


3.05 


2.95 
1 __ — 


2.S5 


S...S 


! 3.1 


i 3.0 


3.0 



Table 2 The short contact distances between each kind of atom pair. The norma! and 
extreree distance is Jx©^ (1963). The caliMated #staaeeis fromlijima 

ei al, (1987), 
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Example 4 
Modeling disMlflctfc tseods 

PIOS] The present indention includes a .flexible method to search for the potentially 
emoting disulfide bonds in a stmctuxe. For to specified cysteiBe % the coordmatas of the 
sisMiit atom S' ate generated and the aooniinales of 

ateois ^ C% The positions of a&stss ffittst be on a circle which is formed by 

rotating about the Ga^Cff bond with the rotatioiml angle %\ Statistics shows tfeal the favored 
df&edrat angles %' 1 ^ arooM -60, 60 and IS©**, In the present iaveadoi:^ a wider region 
arormd ^acii angle was scaled ile^^M&mi -20 to -400, 20 to 100 and 140 to 220. The 
coordinates of S 1 are recorded e^esy four degrees when ^tatiiig afeotit bond. The 

same procedure is applied to the specified residne j > The distances ^ for all the 
ge&entted atom pairs S* and S j ofttkfth -circle -FIC5-4 shows one of the generated snlfhr pairs. 
If the distance behveen W and S j is witlife ZMMOAk and :: ibe bond mgfe: : :G^^S i ^<l- C^S^ 
S l within iG4±5* and the diliedtal . atigl^. x^CCP^S^C^) within i90|±40 d 5 it was assumed 
that these two cysteines cobM form a disulfide bond. To ensure that tile generated S Mom 
positions me in good, geometry with all . a^9c?3C5r»s-- c*^" 3 ^^©^ eon^rmer, a van der Waals check 
was also performed This reacts many position pairs, espeei&Iiy when Co 5 -Cp 1 bond of the 
first €ys is approximately in line with e^-Cp^ of the second Gys residue. Hie best position 
pair wag then selected, 

10111] Te test the pmeedpre for predicting the disulfide bond, 19 disulfide bonds 
were examined tfaat we^e bo t saeeessfeSy modeled by Sowdbamiiii ef Ml, (1 Table 3 
Hsis the data irdm the modeled and ^sMio^apiiioalIy> observed: .diisiaMde: bonds.. All the 
disnlSde- bonds; were suoeessftdly predicted (Table 3>: Moweveft pro disulfide bonds liave to 
be modeled by adpsti^ angle je^-ito be beyond f90j±4O* , 
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Protein 
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Gp 3 -S j -S r 


Azunn 


1.9h 


-i>2 


-65 


-3 7 




- / 2 


103 
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C2AZA) 3B-26B : 


1,95 


-87 


-66 


-60 


-66 


-70 


101 


102 


Carbox>'peptidase 


1.89 


93 


-79 


-74 


144 


-57 


103 


1.05 


(5€FA) 138-161 


1.93 


92 


-84 


-78 


138 
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99 


Lysozyme (MEW) 


2.1 1 


80 




-178 


148 


42 


105 


104 


(6L Y2) 76-94 


2.32 


70 


-28 


-ISO 


153 


44 


101 


107 


Lysozyme (Human) 


2.08 


95 


62.6 


-71 


81 


-58 


103 


107 


(1LZ1) 6S-S1 
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94 


62 


-66 


82 


-58 


104 


102 


;0vomycoicL third 


1.99 


99 


-179 


-70 


65 


-47 


105 


104 


(20VO) 24-56 


1.99 


105 


477 


-66 


64 


-53 


104 


103 


Rat mast cell 
protease 
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-88 
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-85 
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GlxitEliiioiie rechictase 
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1.98 
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-78 


-139 


-82 


105 
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(1TPP) 42-58 


2.16 


-72 


-108 


-70 


-144 


-81 


103 


108 ; 
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b-Trypsm complex \ 
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2.03 
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-70 
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100 


104 


(2SGA) 42-58 


1.90 


-93 


-76 


-58 


-1:45 


-107 


107 
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2.05 


101 


68 


-76 




-55 


104 


105 




2.16 


93 


63 


-82 


96 


-44 
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Wheat germ 


i 2.00 


90 


-62 


-59 


144 
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(3WGA) 17A-31A 


1.95 


103 


-64 


-55 


136 


-66 


104 


100 



Table 3 , T^e.:fii^t,calTOBi gi^es fee protein nmms, the foar letters code snd the 
residue pairs for the foll0wmg; d(3-SJ 

fee distance between tfcq(tWo stilfem, %S-S the torsioii atigle Gp^S^C^ % ?: J and x H 
torsion angles N~Ca-Gp~S for residue.!' a&dj- %■ ~ 2 ^d for 
residue i axsd j - € p^S^S 1 aad G^-S^S' the bond atigle^ In each row the first line is the 
crystalIogr^>lHca% observed data and the seeo&d line is the modeled the data. Hate: a; In 
ibis case, v&ttati^ is set by |90| ± 70°, 

The program 

[01121 Tfeept^^ 

to perform a Monte Carlo search of edBfematlooai space. The idea for sampling 
Kama^liandrari m^s was based om the program (SowdliaiiBm at, 1S>93). Hie 

inventors used fonder M McbaM's TOtaHfer library (1989) along with a mibroBfina to 
generate side cJiaHis. Part ffi^ standard Formmi-J7 imi the part of 
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the program that calculates fee solvent accessible smime (SAB) energy and does the 
tiiennodyriamic analysis was witeii ill C ^Hyser & Eraire, 1996); FIG, 5 gives a Sow o&art 
of fee |^g?am that is nsed to model . <$isulgfe bonds. In step 100* sequence, disulfide bond 
mmediimi^ m ate- inputted The starting data is inputted tjiammlly or it 

may fee retrieved fem a database that is well known and used by those of skill in the art 
FMm the m$Mt data, difeedrai angles % o>) are raiidoMly g^etaied in step 101. The 
generated dihedral angles are nsed to generate a pMypeptide in. step 102, Once fee 
polypeptide is generated, a Van der Waab cheek is performed in step 103. If the van der 
Wsals check is acceptable, then tlie SAS-hased energy of the polypeptide is caiceiated iti step 
104. If the van der Waais check is not acceptable,, then dihedral angles are regenerated. 
Hexfc> step 105 searches for existing disulfide bonds in the g^erated polypepiide. After the 
disulfide baiids ha^e been modeled^ a van der W&als check Is performed to ensure that the 
sElfer atom (S) is in good geometry with all the other atoms. The best position, pairs are 
chosen and the SAS4?ased energy is calculated in step iQ&< 

fOlOi] The MPMOP program Is designed to generate disulfide handed confbnners or 
generate disulfide bonded conferiners and linear confermers. If the program is mn only to 
generate disulfide bonded eonforniers, thari it is considered • -the. fast moef % which is 
ifetrated in FIG. I3A .and.FlG.: 13B- In step 200, sequence* disulfide bond ccmieetiviiy, and 
other parameters are inputted. The starling data is inputted mannally or it may he retrieved 
torn a database that is well known auci used by those of slell in the art. From the input data* 
dihedral angles (cj> ? xjr m) are randomly genemted 1b step 201 and angles are assigned to each 
residue of fee backbone. Hie generated dihedral angles are used to generate a backbone 
atoms* starting front three given atoms in step 202. Tbe distance pairs are checked in step 
203, It is fenpoitant to determine the distance between the. two Ca atotds and. the dM^ee 
between the two Cp atoms. If tie distance is not aeeeptable > then the dihedral angles are 
regenerated. The distance between the cysteines (C) plays a role in the rate of loop closure. 
If the dis^anee is acceptable, then a Vm ster Waals check is perfemied in step 204, If the van 
der Waals check is acceptable* tk^ i&e^tesf of the feackfeone is generated, in step 205, If the 
van der Waals check is not acceptable, then dihedral angles are regenerated. While 
generating the baefeone^ if the van der waate cheek letnaiBS acceptable, then modeling of the 
disulfide bonds is performed in step 206> If the van der waais check does not reinaln 
acceptafele^ Jfc^i dihedral angles ate regenerated. Ne&t, raiamers or side chains are added to 
the h&ekhane in step 207. Rotasners are : ad<bd to e&chxesMne exempt for tfee: cysteines. From 
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step 2 0 % one can collate ail the iioiie~ v&h der wmIs violatio&s m step 208 and regenerated 
dihedral angles and in step 2Q9 &e fe^ope and all rotamer sot^SaatKHis are wiita^ 1^ a 
file. If the van der Waals check is acceptable for ea€;h rotain^r m step 210, then disulfide 
hosMed pairs are Reefed to eris^ (S) is in good g^om^ry with all the 

other atoms in step 21 L If all the checks are acceptably thert 

i&foiTm^ a ille &\ step 212. Next, a ^Misig test is p^feriiieci in step 213 for 

each eonfeH^ 

Finally the S AS~base4 energy i$ 

[01 091 As menlitJiiBd the MPMO© program also can generate disislfide bonded 
eoi^rmers and Imoar m&Bmmsc$< TMs "the slow mo<f \ 

which is ilhistrai:ed iB FIG. 14 A and FK1 14Bv In step 300, mqmmm^ MmBM& bond 
collectivity* arid otfoer par^eiers are iBp^tted, llse or it 

may he retdeved from a database feat is by ths^e of skill in the art. 

From ^ are r^dor&ly ge^emlM m step 361 and mgles 

are assigned to each residue of the hacifcoBe. The aggies are used to 

generate & h&cl&o&e aiohis, starting from thi^e gi^ea atoms in step 302. Keoct* -/the rest the 
hackhoBe is generated hi step 303 : W the : fm der Waals ehecfc Is aoeeptable, rolamets or side 
chains are added to the ba^hoBO iii ^ Rotam^rs are added to each resi<fee^ After the 

Toilers are addsd^ the diMmK?e pairs are cheeked, modeli^^^ bonds arkd van 

der Waais check for the SS pahs with the complete eoiifemier iii step 305 , If any step to 305 
is unacceptable, the imrtib® & SS ho&d is recorded aad the 

program is lii^d to tte energy AS for each 

confbrmer in step 308, If all steps m step 305 are aceeptafefe then the -number of the 
oo:^ is calculated in 

step 306. After tbe ealetlaM^^ 

lOtlSI ^ p^gr^ is capahie of generation as 

shovm in FIG, 15. fe step 400. two reside imoib of Che flexible loop of the protein and 
the accuracy are hspjited* Hie starting data is kqmmd mm&aily or ii:i^y-b&:ie^mtd[^Qm- 
a database that is well imo^ ^ used by those of skill m the art ¥tom the input data, 
dihedral angles t|r -®) raadamly geiierMed iri ^p 491 and angles are assigned to each 
residue of the hacfehone, J Ehe generated dihedral ahgies are used to geiaerate a bae&bo&e 
alonis or maiMGhahi atoms* The distaiiee pairs are checlced In step 403. It is iiopcMtant to 
detes^iae fee distance batween the two Ca atoms and N and C termiBals of the ooti&rnier» 
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In st^p 40% the dMmee between M by aheririg 

or modifying fee dihedral angles, Step 495 xequix^s that the immliaess of the coiifemier be 
same m the cutting parts of the target protein, der Waals oheefc is pear fermed in step 406 
of the maku^mm atom pairs. If the van der % then the cooforaiers 

ar^- aligned to the target protem in step 407. Van der Waals cheek is periom^ed in step 408 of 
the maiBdham and target proteisi atom |>airs, if it is a^Geptable, then rotamers or side chains 
me added to ^ step 400. If tii^ d^r wa^ls clie<ifc is ^£^eptal?Iie for each 

rotamer hi step 409, then mfonnation is written to a gle in step 41 0 , 

|010©] For the disidfide bond modelmg module of the program is Illustrated 

in FIG, 16, In step 500, coordinates of M 5 Ccx and C|3 of the two cysteines are obtained. 
Mext. in step 501. a distance eheefe is perffemied im Ca to €!a am! GB to Gh, If the distance is 
not acceptable, ffim ol^^ step 500. If the dktasiee is 

acceptable, thmi fee is generated on the eirele fermed hy the rota&osx along Ca-Gp boM is 
step 502, Next, bond l^gth, hond s&gl^ If 
the measurements in step 503 &m aeeeptabie, fe#sidi^ 
are written to a fite in step 504. 

[Oil ?! The; biisdmg: test i^K>dnle of the MPMG33 program is illustrated in FIOv 17. 
Step 600, requires pdb coordinates of the gmmM&& Qmifhmwm and the crystal stmot^re, 
segment: of sequence both fcr best aligmnents^ a^d three options for 

test A l?indmg ? \ Once all fee Infeoiialion is gaCfeered, tile eoiifbiiner is aligned to the 
eoiresponding peptide crystal structure in step 6Q i >, Mmt the root mean square deviation 
betweei) each modeled eonforiner and tM> target : ;pep€de is determined and the average of 
eonfbn^aiioiial angle dif&renee betweep ea&h residue of the two coiybrmers is detennioed, 
if the values are •.a^eeptab^ check of eaeh con&rmer with: the protein is 

performed in step 603 , If the van der wa&Is check is acceptable, then the SAS~b&se energy 
&r eaeb ^ and the statistics are prethmied in step 695. 

Examples 

T&e foop etosHre rate for the cyelic peptides CXC and GXSC 

pilSJ Conftenatioiial ensembles for a series of and €^C, where X is one of 
the smip acids Ma, ¥al y Pro and C3y 3 generated. Each ensemble consisted of 4CJ00 
coBformers that can t)im a dis^lUcls hotid* The side chain was added to the haokfcoiie for 
each cmx&yrxmr. Gm^y Hie fe&cl^ were gyrated ia each 

coii&rnier, 
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[01:1:9 J .The dishlS<fe ho&d lo<^ closti^^^^ m Mj!s^ 

where N c is the number of conformers that can potentially fomi disulfide bond and N c is the 
lumber of cm^bmiem that am not form a disulfide bond but hm® passed van der ^aals 
check TMs deiiiiticM i s siMIar to that of rates 
of loop closure from the medelng and experimental data for ea^b typa of pejstide. The 
relative values from the modeliBg dopist ihe seme pattern as the experimental data. The rata 
for CFC is the highest m fee series CTPC 'as has the iiUhe CXXC series. 

To directly compare both values* ih^ to the same level. The common scale 

factor fer CXC md CXXC can be defined by K^p^|>)/SfMod) ? wher© ECE&p) is the 
s\rnimat!oB of all the ex|?OTBi0r!Jal rallies, and 2(Mod) is tte sinxnnatiou of all modeled 
values. Each Individual value of the modeled eon&imer i^ by the s&ale factor $L 

FIG^ 6 gives the conipaii$oB: of scaled values of modelmg with the e^ values. The 

values; are m B^mm^^'^-^M^ series, 

{01201 Table 4 sfaows ft series are much lower 

than those for CXXC series. T^ laeto% 

cysteines femi disulfide hand? the 

distance Cj-€j. h^meotx iiie two ;'€& atom of cysteine 1 and j must he within 4.0 to 7.0 A and 
the distance C^Gg* between the tw0 atoms iroat be within 33 to 4.7A. Tho imcntms 
surveyed all of the cm^bmwr$ that passed van d^r Waals check and found that for the CXC 
series the avenge distance for the two Cas was about 6,2-6.5 A a&d for the two C£s tfe 
distance was ahout The distaiioe of CFC was the shorted The averaged Cp 

distanoe of the randomly generated oon&rm^^ the suitable distance. Due to ,GXC 

only having three residues, the degree of flexibility- of the backbone is not high enough to 
make the C|5 distance shorter mnfcss tho stmdard bond angles md homd lengths cha^ige. The 
ratio N CBC b/^vdW* where N cacb is the msniher of corrfbiineiB that have seitafcle Ca and Cp 
distances and Mym i^ the total Bmhber of eon&rmor thai passed van dor Waals oheck, is 
0.72%, 0,63% ? IA5% and 0.23% for CAC, CVQ CPC and CGC respeotively. For the 
CXXC series, the average distance Tor the two Gas and the Gps are respectively 8 AS J7A 
and 9AS£^ with CPPC hemg the shortest. These distances are &rther away ffom the 
siasdard distaooes for forming disulfide bonds* hut the residues have a muoh higher degree of 
flexibility for haekhqm A higher pereentage of Get and Cp 

distances. The raiio M^M^m gives 2,19%, 2.61%, 0,53% and L26% respectively fer 
CAAC, €¥VC, CFPC ami CGGC. 
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{83211 To t>mi - a- disulfide fed it is not jmt kpriBt to imw a siiitalsle Ga and €p 
distance, but also for the two sulfers to have good sterie positions. The inventors checked the 
number of con&rmers that satisfied the Ca mM Cfj di^armes arid tile nnmher of oonfbrm Mm 
that formed a disulfide bond, M$Mmg& the eoido^ Ca and Cp 

distances, the probability of fbxinfeg a diMtfide hand is still smaller than CA&G because of 
CAC lacking a set of suitable geometrical parameters such as the bond length S-S, band angle 
Cp~S~S and the torsion angle Cp-S-S-Gp. The ratio l^/M^, where N c is the ntsmber of 
coBibrmers thateaii &nn disulfide bonds^ jfor the CA.C 7 C¥C 7 GPC and CGC series are? 5, 7%* 
4. 6%, 6.0% and 8:4% and to ratio H^H«wb. & r CAA€, C¥V€, CtPG and CGC3G are 36,0% ? 
35,1%, 343% and33>^ 

cotiforoiers thai have suitable Ca and Gp distances, btit also ha^e a lowar peteexitage of the 
cQ^iormers in which the^t^ snifars to have good geometrical positions to fonn a disnMde 
bond- These factors led to a lower 



| Amino 
1 Acids 


cxc 


GXXC | 


Mod 


Exp 


Mod 


Exp 


Ala 


0.091 


3.1 


0.604 


71.0 | 


Yal 


0.066 


1.6 


0.748 


71.0 | 


| Pro 


0,129 


7.2: 


0.270 


20.0 | 


Giy 


0.093 


1.5 


0.357 


31.0 1 



Table 4. The probability from modeling (labeled by Mod) Is defined as MJM^ where M c is 
die 4000 <mdbrmers that can fornix S.^S bond and N 0 is the mmfeer : of cottf oiiner^ that do 
not form SS bond but have passed irdW check. The eqMlibriinn constant Oaheled by Exp) 
is defined as fc/ko ? where k c is the loop closing and k t> the loop opening rate constant (Zhang 
and Snyder, 1989), 

|ISi22f of loop clomre S>r fee peptides haviixg the same 

nmiiber of i%sidnes, tbe.;|^a^^: : -jS&ar CPC is the largest and fbr CVC is ^nallest for the 
CXC series. CM the conh^ry> the probability for GFPC is the smallest and for CWG the 
largest for the series This caused by a ^ed aJ property of Pro and The ratios 

of 'Hc/Ncte^ fo^ the two series are about the same, so only the ratios of H^^b/N^w arts 
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sigmficaiitly diiia^eM, Due to the py^IMte xfflg, the tries to,-.^!1:r®fi--;t»?o cysteines 

toward eae& ofiier, so thM the two vysmk^ a better Cp--C^ distance. It has a 

Mgte ehanec of loirakg a diMSde feoM tliasi otfe series. In fact, die eomfeiB^loB of 
XPX, wfea X is a Bon-prolMe amiBO a^i<i easily form p feim. ©n the ofer hand, Va! 
Has three rotaM^s ^#s-^ '»i : :: F©r :: the>G¥S peptide, t&e rotamer with % l =180 
tries to pusli the two cysteines toward aaeh etli^lj^ tfee rotamers, wifli x 1 - -60° and ©0% try 
to push the two cysteines a^m^ Som eacfi oiier ? tsNe»aii&jE& /iibe-'-iwdf ""feeeaaelies-- a*^---atieffi@st : - 
p^endicuiar to tkm b&cl&one, M is more hIMy that ffie CB feraiich will push the two 
cysteines away torn each lias ■ : ibe:: -sxa^feSEai- ;l£k)|>. =ofe^aEra. 

As for the eonfermers of €XXC $ the Se^^ role for the 

loop elosism SiMe the ha^ has a siiiieh lo wer other peptides m 

the GXXG series, tte loop elosafe GgPG is also lower thaa for other members 

of the CXX€ seri es. 

101231 Hie inventors determined how many eonfefmers were needed to get a 
meapiijgM ratio Tim ratio co^erges to a stable ?ate as the number of eoafotmers M 

the ensemMe is iisereased, FIG,? sfeows the iratio olxanges with increasiiig mimbers of 
go&fbnn^^ ttie ensemble, the 

Ouctuatioa of tie mios is As fie^ the ratio M G M<> 

converges to a. stable value. l&^fo^L1iie : eoiiv^rged ratio may coBiparad with the 

of sonfomiers* One thousand eonfbtmers in each ensemhie is e&migli co^lcHmers to gat a 
coswerged ratio. Tiie fluctuation after 1009 is iiot target than 0.003% for fee CXC series mid 
not larger thaii 0,05% for the series, 

Example? :■ . 
'Eke teBger GJC^C polypeptide 

PI24| Ziiang aiM Snyder (1989) also i&east^^ constant for the 

series of C^M^ to 5, It was ; ;^?md:.t|^|:## Mo o^nstaBt decreases m the 

order of CAA CA4C, CA 3 C S CA 5 C 5 CAiC, with an even mincers of M Mgh and odd 
numbers low (see line 3 of FIG. 8> The result of the inventors modeling using only the van 
c |er Waals ^pro^hnatioB does not agree with EfiaBg and Snyder's experimental 
results, Wzm inventors m^e&se the naniher oC alarifees betwoert tli^ two oysteiiies^ the 
probability as defined hy 8), Hie 
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peak fer C/uC was B0t eaptured hy &e modeliiig. This m not -mipdsmg because the 
in^enMr's calculation only considers geometrical factors, wbiie the mfea^ 
interactions are more oompli^Med in fee ©xperime^f . ffle teentars informed a survey for 
the N-H...O hydrogen bond which was limited only to the haeidsone. The criteria for 
forming the H tad are 120° < 9 < 180° :fer ' .fce ;M-M^..O hm&-m^--m&--4&&& for the 
distance between H and O atoms. The T0tm Bimm^m^ of hj^^gen b<mds in the ci^tiifide 
bond closed, coBfOpa&s divided by the total ninnber of ccmi^rmer decrease in the order of 
CA 2 C, CA4C 0 A S C, eAsG, €A,v€ 5 ; is similar to that of constant (see line I of PICKS for 
ratio of H bomis). This iiidic>ates are Savored to h we 

H bonds that stafeihm the stnicture, 

f0125] The solvent accessible surface (SAS) energy AG was calculated. Since the 
hydrogen bond is not eo^sidared iB eaisml athig AG, condensation was given to the energy, 
Vm each H bond, the energy is Increased G>5 mats. The energy weighted prohability is 
defined as 




wMeb is the ratio of partition fenction lor the closed peptides divided by the partitioo for all 

imelosed peptides. This ratio Ibllow 
•FIG^S). ::S : ome ,eoBfoifner$:M the ensemble that can ,fem : SB bond have a Mgh pipbability 3 
which leads to a high ratio. The pmk of GA4S is slightly Ja^er than CAjC and GAsG. 

CoBsf MctioM of peptides for moielmg the pepttS^slreptavMlii c®snpl£& 

|6126| lhe backbone dihedral angles of the peptide: were ta&domly generated in 
the four Ramaehandran maps,, one for glyci^ one for proiine, one for the Q3~braneh€d 
amino acids {VAL V .ILE and TIM), and one lor all olher aniioa acids. The tram aod eis §>nm 
of the pef tides have dihedral angles of SO 0 and 0* with a small random deviation (usu ally 
witMo &S*% The feao&boiie of peptide &n fee dihedral angies sj/, 

o>) and the st^idard bond lengths and hohd angles. Bonder and EieKard r s rotanaer library 

was used to add side ehains to fee bacEbone, The single hard sphere ai^oxiniatioB 
was usee! to eBnhnate the grossly h^rGhable eo of as a hard 

sphere with ^ radius. The nrfmrninn distaii^es pijiiBa e$ aL f 1 987) 

between two atoms were used for the van der Waals check tm each atom pair. These 
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distances are about .;fi&-:& 3 4A. 'sJbcaaESe*" : t&a^-":^ye '^ri&aF et at, 

(1963). K are ge&etated, the overlap of the H 

atom will otfeer atoms is even larger faSx^i wobM not fee 

efficient to generate the corfbrrrier Mm to vax ier ifel ^i^jati^HSi. 

Modeling ctls&ISIcIe fo ®b cfe Mt t&e pepficle-strept^lciisi. complex 

|8i2f| A single dis^lflie bond can be modetec! losing the method of the present 
iBventiotL Wlt^B tw& ilfe^lfide bonds are be paid to: 

the eeiraputatioiiai ..effteie^ form two disulfide bonds slnMtaBeousIy for 

a polypeptide is the produet of the probabilities for eaeh disulfide bo&d to form* Gim>mtly ? it 
takes a long time to generate ooe peptide with two disulfide bonds. The inventors have 
created an efBcieni way to mpdd a t^*o^ With twoniisutfide feondpd 

loops, the short one m modeled & 

forms a disulfide bond, Wbeo the first loop is fixed, it may take teg time to find the second 
loop if the first k>op does not have mitafele gecMotry, llrereiote, some rnimber of tries must 
he given to search for the seeotM loop, white the coBfbimaMoii of the first loop Is fixed. The 
number of tries is Bsnaliy set to he between, dbotit. • 5 :: -&h4 10, It is possihle to -obtain several 
polypeptides mth one fixed ooBfomiatfen for the first loop a&d various oon&rmaMons for the 
second loop. Ail c<m§mmz$ in the en^einhle are kept fer the Abiding" test 

fil^SJ If the polypept^ 0'e, the nitrogen (N) 

of the first residue niafees a cowlent bond, with the carhon {€) of the last residue) Ihe method 
for modeling the disulfide bond Is no longer valid, r E he criteria to form s^ek cyclic peptides 
are I 35±<L6 A lor the M-C boM^fe the bond Mgles (0ArN-C -orCA-C- 

H), It is less efScieot to geaer&ie strfi oyelie peptides than to generate a one disnHide-boBded 
peptide, sirtce the former is $mt£li$& . m&y fey mm position of the atom N or C 7 whereas the 
latfer is searched from aBiiihher of positions of sui&r. 

Example 18 

Allgs^mg t&« ecmformers to ike MBdmg site of sireptavMtn 

P129] After msssnhles of coafermers were generated* the 4i: biMing 59 test was 
performed. The first step is to align the confbtmer to tha template. The template is the 
peptide in the eo-crystal stmefera complex. The second step is to screen the eoBfernier by 
using the hard sphere potential modei For the peptide-streptavMin eompieK, the dominant 
binding force occurs at the HPQ sequence of the peptide, fee raodeled eoEfomiers were 
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aligned io the correspoj^Bg HJ?Q sepgooe of ifee crystal stooture of the roMpie^ Any 
higher msolotlom X-ray crystal structiire cm be tisecl for the template, Two criteria used 
to detenni&e ^ alignment is sucee^fei One criterion is the root mean square 

daviatidB. .(misd) between a&dx modeled e^ i 
R.msd(k 5 t) « "Cgir^C OMfe J>a^,j9^ Cz(k s j)^(t ? |}f 

where b is the mimber of atoms partidpatiiig l^ ffi$^^.{wH) for Ike HPQ 

pi3©| Aether criterion is 1^ angle dtSM^ 

each residue of the twa miteis, 

AA(k, t) - 2^i*ii*^6 -mm Hw^-ws 

where m is ^ (m~~3 for the HFQ 

sequence), 

101311 To deteonme whether the aligtoi^t is aceeptpfete or. not,, two common 
reference values zmsd^f and AA*^ for ni*sd(k s t) m& . t) respectively are givan. -.For the 
ktfa con&rmers m the es^es&M : £> if nnsd^fe, t|<m^sd re f and AA(fc iys&A^ are satisfied, thea 
this alignment is acceptable. If any one of fee criteria is not satisfied the Migmpent is 
unacceptable and the eoiiforiBer will be rejected^ For f he HPQ se^enee, fee reference values 
are rmsd^OSOA (Three atoms Ca, € and N were used for fee aligmiient for each residue.) 
and AA rs r : 5f}°. 

|0132] If the aligBmmt is aeceptable^ a van der Waals cheek willi steeplavidiii is 
performed as the second : fctep to determine whether or not the imal doekiag is seeeessfiiL If 
there are any eoIHskxos ." '£p^ . thj& -aS%Aii ; 3paji^'-;<>f otO*afi»aQM* the target proteM, the docking is 
not successful and. th& confomier is rejected, The -atom.radms for van dor Waals eheek is the 
same as. those tBmtipried hefore. If there;: is : : no vm der Waals violation for any atom, pair* fee 
convener is co^Mas^ doeiksd into the -pmteift*. ' w bMding; ratio^ 

can be defined as the ratio Nv'Ht, where Hb is. the immber of conforms that can be 
saccess&IIy docked into the HPQ hfediBg: pocket and N t is the total number of the 
confboners in the eBseniMa The ratio correlated well with the experim^taliy measured 
binding alSnity of the pomplex. 
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Example 11 
Ouster 'sfi&fysis of the HPQ secpe&ee 

Pi33] The imrmtms hm® surveyed tlie p^tide-strepta^din complex. Table 5 lists 
the peptides aM^e strepia^din. Ensembles of 

eonforniers for all these pepides h^ following the above procedures. FIGv 

10 giyes an example of the e^emfele for the peptide of The HPQ 

sequence of the peptide is crucial for binding so it is . necessary to imow what firactioa of the 
modeled eonfomiers can adopt a type-I p to fe ie HPQ sequence. The crystal structure of 
CCSPQC^^^VBEG (FIG. 9), determined at resolMJoii 1 .46 A, was used as the tenipl&te to 
calculate that fraction.. All the modeled conformers axe aligned to the HPQ sequence of the 
crystal smr^otiare CC5IP<3GGMVEEC r using the refetenee values mmd m r s Q,S0A and 
AA^rSO 0 , For each oonfornie^ if the calculated ifflsdCfe, f) axid AA(fe^t) are both less than the 
given t^fcrexice values, the cosifermer is said to he '^Q-like 3 ', or it is similar to the 
crystalstmeture in the KPQ sequence. In otter words, the modeled HPQ sequence oaii adopt a 
typs-I turn. 1M : p^rceBtag^: o£ coMomlfets sMe tesatMy the eriteria is :listed in Table 5. 

|S134] The BPQ~!ike coBfermer fer the linear peptides (around 0ty about 2-7 
times smaller than the peptides with disiaSlde boBd |12%-42%| (^afole J),, me reason Is that 
the linear peptides are not restrained hi e-oBfonBationai : space arid - can : accept various 
coiifoxniatioiis. Whereas, for the peptides wife a #$uMde bond, the configuration is 
oonstraieed. The HPQ>Ji&e ratio for the '^^-^^Sm^oes- not wy much. The ratio for the 
: .0Ychp: peptides varies according to the type and immber of amino acids between the two 
cysteines. The only diSfer^ce between the confbrmer i^GfiPQFMCJIECJMS: and 
^iCHPQI^CIESRX is at residue 8, Bia= both ha^e a significant differ^ice iti 
which the former has a ratio of 22,4% and the latter has a ratio of 4X,9%, Having a proline as 
residue 8 greatly increased the chance to $mm a typ e~I f turn for the TIPQ xnotlf; 
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womm$m: 





Observed 




Peptide- 




K A 1 


HPQ-tike 
(%) 


Binder (:!%) 






0 (KM 


4 


1.3 


FSHPQNT* 


125 


0 008 


6 


1.4 


AMFQm&EK 4 


136 


0 807 


6 


1.50 




904 


Q 095 

\> v \Jp \J ^ 


6 


9 S5 


AECHFQGFFCIEGSJC. 2 


0 21 




24 9 


"11 '3. ~~ " 






1 08 




28 7 ! 

- 




7 


0 14 


22 4 


113 




0 47 


2 13 


16 1 


6 6 


cyclo-OTFCpi^ 




3 70 


-t. v. * * / 




cycIo-CIIFQCTFC? 


0.67 


1.49 


23.5 


12. 0 


cyclo-CAHFQFFAE^ 


0.13 


7.69 


21.4 


21 


CTcIo-CAilFQFGAi:)^ 




0.05 


12 


7.1 














! 1.3 




27.S 


7.5 


RGGHFQCCSMAEE^ 


; 2.3 


0.45 
____ 


25.3 


7.2 


1: RCCHPQFEPeMGC 3 


0.33 









The list of experimentally observed binding and die modeled "bisd ratio" ib. 

1. Weber etal f (1992) Bioebemlstry 31 5 9350-9354. 

2. <3iebel er a/., $995) Biochemistry 34, 15340-15435. 

3. Sc&midt et al, (1996) J. MoL Biol. 255, 735-766. 

4. Zang e* £?/... (1998) Bioerganie & Medical Gliemistry Letters S, 2327-2332. 
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i&lMl In the peptides, C03F^C3CS&f^EBG and GGKPQeSM AEEC the fmt two 
cysteines too close to ea<?fe otbar to form a dss*dfide bond. The ca&3hhmliGn& o f the 
disulfide bonds that can be formed ate thg cussed iomi Ql^G& y C^-Gl2^ Bnd iherissted fi)rm 
G3-€6, -Gl-Cl-2; Hife crossed Iohe lias a higher p e^ coiilormers tfa^ t&e 

nested form TMs was caxisM by ifee smaller loop. Zhang and Snydat p 989) showed the 
eqmlibri^m eonstaM^ fer forming CXXXXC. The first 

loop in GGHPQC far the crossed fon^ higte ratio of iype-I J3 turn in the 

seq^esce. Wbm Ala is replaced by *¥al tor peptides wife two-ioop% tho fe^ti^s of MPQ-iifce 
coBfbrmers increase. The ©K tei^ liiiiits; fee ooiiformation of the 

HPQ secpence, winch en^^ of i^PQ-Kke coiiformers. 



|lf 3i| T&e X-ray ■m-mystal stmciuze sfes^ tie peptides bind to 

atrepMvMin at the same site. The 1S?Q sequence is crucial for tte biisdiiig af tii© complex. 
When the HPQ motif of tile modeled cc^rmers is s&miar to thai of the corresponding 



""binder", I^^ : large? the faction of '* ^Mader 5 in tte ens^ the binding affinity 

is for the complex TC^ p^oep|age of *%iiid^ ?> in the 

ensembles. The fraotiosi of '%indM" correlates with the expeiimeBtally mmmmd binding 
affinity for tlie series of peptides. The linear peptides are adopted by streptavidm at very low 
percentage (from 0,SS$& to lul^S) eoiiipared i: mth : ;ffie-' : cycHc or disulfide bonded peptides 
(from 7% to 28 J%% Tl^ measured hM lower 
fhmi the other peptides. This is caused fey the entropy effect The linear peptides are not 
cepsiramed in co^rmatiotial space and lose more entropy wfeeii they bind to the target- 
protein. Therefore, tlie measured .binding affinity and. calculated- "binder* fraction for the 
linear peptides is very km* 

pl37]^ in Tahle 3 were selected from a phage display 

Sferary. There are The eonfoimatic^ is more restrteted 

than the peptides with one disulfide, it may be reasonable to e&peet mi even higher affinity 
than the cyolio peptides hecaase the eonfbiination is more restricted by the two disulfide 
bonds , %e in^ is aete&Hy less ihm that of peptides. 



The hlndiiig ratio 5 * of p^ptld^-strepta^idlB coMgie^ 
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The modeled Sractidti of ^itidef * aiso behaves like the me&si&ed affinity. This may Be 
caused by the geometry of the tiding site fer iis system. ^.ItlioBgli the peptide is more rigid 
aad has a higher IraGiioii of IffQ^ite €<3Ji&miers ? &e cfeaace to collide with str^ia^Mii is 
teller feaesuise the mMpiM^n Is too 1 arge to properly lit tJie ermK^ at the binding site. 
The penalty from the collision is e^eo greater ifean advantage fern rigidity of the 
peptides. The iBnnfeer of thnes that eaeh reside collided with strepta^idin mm co^Hted, 
assuming feat each atom on the peptide collides with sfeeptivtdM oniy one time, PIG. 11 
stews the nmnber of c^ffisicms for each residue for the two disuifi<le-bojided peptides. The 
second loop containMg residues 7-11 (QMVEE) oollides with streptavidi^ than 
other residues., 

Ex&n&pte 13 

Tie correlMlon ffee m£8smr^ ratio 95 

P138] The difference of Gibbafe fbr li|p^ds to fe 

written as AG*^-^ R is a constant 

and T is the tern^ assniiied to have a lM delation with 

modeled **&m energy^ AG? - m*AG c +■ % where ik€^-^llJ*l4i[i^1. -4)3* ft is #e modeled 
"hinding" feetloiiv The slope in and the itiferoept b can be dMermined by^^ 
snmmatio^^^ AG^) 2 . wh&te M is the total iwmher of the 

peptide listed m Tab!© 3 . FIG. 12 shows the eorreiatioh df fte ^tedisig r^tio^ with the 
observed The straight Imo % Res 

Example 14 

. |M39] Compomlds are being developed to kM&& atketeetit and/or replication of 
alphaVimses, flmwiruses and arenaviruses. These; pathogeihe RNA viruses are potential 
biolegical. weapons and. are of general Hiedieal eoBoeni. Mouse brain membrane receptor 
preparations are used to select Langat virus variants that do hdt bind. The E protein genes of 
these vaxiaiits are sequenced to Sn& xmtated regions that identify for 
binding. The reeombiaant protein are expressed and sheeted to X-ray erystaliographic 
structure determination. The cell receptor is also idei3iiSed at this -fey screening a dDNA 
library for binding to the Langat E protein with Mndfeig detected by Imtm^ 
Candidate cDMAs will bo screened fiirfher to identify open reading franies. The pntative 
receptor will be expressed in $& calls. The eel receptor's id^itity Will be conlrmed by the 
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ability of L&ngat to infect tra&sfected cells. The domain of the t^sngat E protein eorri^iiiimg 
the site of receptor material for phage display 

screaiiitig, Bli&ge display tactool&gy is used to ideMtfy to^iB -based cDmpoiimis thai bind 
tightly to domain JB:.^^^:1^;M:^^^: and iiMe^ subsec|iieiit viral 

entry into the cell* P^emimat^ti of the str^tpre of the cell receptor allows additional 
templates for phage display to be ocmslmoted* Identified CQmpooiids will be tested for anti- 
L&iigat activity in Vera cells, then in t&e iBcnse model by kUniperitoneal and aerosol 
challenge, 

£0I40JXhe mventors have detemiitxed that spiperone, a doparMne X>2 sisfetype 
receptor antagonist^ competes with Japanese encepiialitis virus i^r feindltig to mouse brain 
MSB (in^bmne receptor T oxin-based anti-vkai compounds are being 
designed based on families of I0-4S residue dis^lfide-rieh eoni&rraatioiml constrained toxins 
iachidMg apamiii* tertiapln y semioiOMii and eonotoxins, aisd the bnman hormone sndotlioEn, 
Oo&strained peptide loops and i^re agid^^ being rised because the 

stmetumi restrdnts allow the re<feetk>n of eo^ ripen binding and thus 

increase the affinity of binding, extend the eompoimdV bioavailability by reducing- its 
sensitivity to protease; in the sermn and increase ffie .speeMcity of interaction for a single 
target by eSmioMing con&nn^icms that might bind to Mmm pmteins, The optimization of 
toxin analog sequences cam fee rationally guide by an MMR solution stmture detennmatioB to 
identify the changes in oonfemati^n and dynamics. Because phage display technology is 
used, onee a sequence is identified as effective as an aMi-viral componnd, variants can be 
quxclly optimised against related L&ngat E proteins and envelope proteins of sindlar ^irases. 
While the rigid scaffold of the fehss is adapted by fee iiiMMtore, the sequences. Identified 
differ greatly from tbst of tlie wild-type toxi%. elbmBa&ig any : intniisie toxicity. The use of 
disulfide bridged loop peptides m& structured l^xm-bas^d- libraries restricts the 
eonfoiHsatioiial. space sampled by each sequence. 

|@14i| A rational structiira-based iBoreHienta! approach is pursued in parallel with 
strict Mind cambinateial Methodology . Phage display libraries containing random octam 
sequence constrained at thrij ends with a disulfide bond are prepared. Tight-binding loop 
peptides are sjmtfeesized and tested fe^ of viral entry. The costal structure of 

mhiMtory loop peptides in eosnpfe with fee E protein is determined. Using MFMOB, a 
compact folded steotBre is designed to stabilize the observed to^ That 
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peptide is spi&esissed and tested for Biiiding and mMMtory effects on viral Infection 
BiBdmg^te by tpro£^ 

[01 42] ^yitiviral agents are screwed in vitro a cell culture assay. Monkey kidney 
Vero ceil eidMres are pi^treafea wifli different maeeiifmtioiis: of the test ag&xit fcefere 
iBfbolioa witli various diMtloiis of Lmgat yot felted with Laiigat, cells 

are overlaid wife agar coBtEBiiiig the; test ; age&t at tlie same eom^itraliDtt. CMtures are 
znm&MM and subsequently stained to q^aBl% viras paipe ;:fesinatioii in age&Mreatcd vs. 
Bioek-treated eiitareSv Any agent tfeai reduces virus pkcpe fomiatioii fey 90% or greater is 
studied fertlier. 

f$143J >&* vrw model s&Mie^o^ Swiss mice treated with tte 

test age&l one day befot^ at and on eaek of four day Soiiowmg Ixmigat virus elialleiige. 
Bif&rent concsnti^e^s m& routes (iMraperitoneal aisd iBtraaasal) of agent adminlstrati^ii 
are e&amni £d wit!* iiitrap^tOBsal vims c&slenge. Mean day of dead* of micg is eoropared 
with m^ek-treated mise ax*d determined effieaey of fee test agents. Any potential age&i is 
tested further &j its Msil^ 
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imtmded as Ilim seope, Olimges tfeereiit and &Mm uses will occur to those 

skilled m fee art wMeti ^ of the iiwerdmii or defined by the 

ssope of the pmdmg claims* 
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We claim; 

L A compiler^ feriise i& 

the steps of: 

L genet&tiBg^^ angles from a set of protein sequence data; 

ii g^n^ratiTig a pi^tem fo&c&^ 

Hi; performing a w& Wa&fe eaiettlatiori of the proteiii teas^^one; 

iv. calculating a solvent accessible smi ace based energy of coMbifflers thai are 
generated m Steps : -HM;. 

v. niode&g disiriS^ feoiiiis in the protein backbone; 

vi. periormmg^ 

vii calculating a mteot a^eessibie siirface bas^i energy of confenoers ilmt are 
generated M steps wi; and 

viii creating the modMed protein with stnlctfcu^/ feiiii m tle above 



2. The method : of elaii^ stability ; 

3 ; The Method of claim pairs of tlie 

disulfide feonds- 

4.,. The method of claim- 1* wheireiri^ f or 

5, Hie method of claim I , ferther c<^mpiismg .detennijinig fee nmBfeer of co^fbrmers that 
are able to form disulfide bonds, 

6, The method of claim l # further con^pising adding ratamers to emh residue in the 
protein bMkbons, 

7, The method of claim 1 y ilirther cotnprisiiig periomiiiig a binding test for each 
coBfeon^r with avten^late.Moiee^le,: 

8, The method of claim 1, fisrther comprxs&g ealcnl&tmg the rate of disulfide bond loop 
closure* 

9, The method of telmm 8 , wh ai-ehi determiifflig the rate of disulfide hond loop closure in 
the protein comprises the steps of 

L performing a vaB der Waais calculation on a multiplicity of eon&jrmers of the 
protefeM^ those confteiers that can not fbiffi an intratiioleenlar disulfide to 

yipM an ensemble of N* stoically allowed eonfemi€?rs: 
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ii anafysdag the eage^bie of stericaliy allowed corfomters to yield an e&se$#>Ie 
of He confbnners tfeat are capable offotmmg m mit^i&l^Mst ^iMMde bond; md 

iii calculating the raio M0$x which represents the rate of disnlScfe boiad loop 
closure iii fee peptide, 

10 . A method of pr otehi mimatim£^oB. oo^iprismg m o delmg a j^roteia to have the 
B&eessaty active site oo&formM^^ 1 while rediieixig the 

total Bimber of amiao acids m the pmtehx. 

i t , A method of iiicreasiiig binding atfhiity betwe^ by 
decreasing the cxmibmiahonal exatropy loss upon Miid&ig hy te p^tem coiBpxisitig 

eoii&rmatioBal space mmg the mettod of claim 1> 

12. A compm^r-readable storage m^i^ 
e&eeMes the steps of cl&im 1 

13. A eomp^fer^eadahle sto^geimedimi-lia^g . stored therein: a -software f rogtanx ^rMd;i. 
executes the steps of ckixn 9. 

14. A iiK>difi€di5roteiii with Increased binding afBnity produced by the method 
comprising the steps of: 

L perfoimhsg the steps of the method of claim ,1 ; apd 

ii. performing a binding test for each conformer with a teoiplate moieeisle; and 

iii. creating ^ characteristics foimd in the 
abov^e steps to Micrsase /Mtidrng a^Bity m. the modlBe4 -ptotein, 

15- A modified^£HX>^ steps of: 

i perfbrmmg the steps of the method of claim 1 ; and 

ii> creating the modi fied protein using stmeteal chatacteristics fouM in the 
above steps to iiser ease stability of the i^diSed pxoie^^ 

I C A coropnier-assis^ a protein composing the steps 

L generating random ec^^ in allowed region of Ramachaedraa 

maps fe>m a set ofpr-oteM seqiienee data; 

ii. generatmg a protmn backhoBe using said coBfbtmatioiiai angles; 
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in, psrffeiming van der Waals C33c;idatio& for esoh backbone atom: 

tv, dafemimiiig disulfide 

v, adding a rotaiiier to resicfees in ibe backbone; 

vL perfonnfeg Ym der Waals caiculalion for rotamers: 

vii- performing biMmg test with & /tedajpiate. protein; 

vili calculating solvent accessible surface based energy of all confbmiars thai are 
generated in steps i-vii; and 

ix fcreats^ the 
above steps. 

17- Tlxe me&)d of <5lai3^t ; l^-^y^ssre^ gpner&fi^ eompd.ses 

IB. TKe metfaoci of claim 16, fiirtli^ atoms, 

19, llie method $f <4a&& 16, father eompftsmg detei^itHBg coordinate pairs of fee 
distiliide fends. 

20, f Ilis nMMod ofctmm 1%*^^ K> € A a orC 

21, The rsstfaod of Mmm 16 > wh^^#0 eosi^ or <0- 

22, A coir^^er^r^adafole storage medium iiavrng stored therein a soft ware program wMcb 
executes fiie steps of cMffi t6, 

23, Aeomprst^-^ 

t geoei^SiBg random eorifim^atlOBSi angles hi allowed region of R^masbancfem 
maps from a set of pi^>teie seqiiepee daia; 

ii genef atmg a prpi^in feael^osie using said QOBfbrtaatloiial a&gles* 

iil a£M&!g a ratainer to residues in tiie backbone; 

iv» cMs^iBirig dis^ 

v. Imfemg method to a eoiniMter assisted program Qaat calculates Imear 
conlfermers; 
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v. - calculating solvent a&eessiMe st^face hased energy of t^^&iiners thai are 
generated in siep$ and 

vi. creating tte 
above steps. 

24, The method of claim .23* wherem the eompnter assisted program feat calculates linear 
coiifpmiers is COREX, 

25, The method, of elwi. .23 wherein .fee. eoiifomiatlom! axigies are 4*H*£ orm 

26, A cqmp^ep^ 

executes the steps of claim 2 3 . 
27V A method &r detemiim^ 

cdmpriamg at least erne t^o-eysleim where n is an 

integer* fee method comprising fee steps of: 

L perfasmsi% on a nrnMpiieity of oOHfbrmers of the 

protein and subtracting those zonftmrnss feat can ;|iot:fdBii' an intimiioieeiilar disulfide to 
yield ensemble of K<> ;$ferleally :allo ieoMoim^rs;. 

iL analyzing fee eiisemble of sterieally allowed con&mmm to yield an raserohle 
of H c Goiiformers that are eapahle of i^Bfcg an iBtramoleoelar disulficie boiid; and 

iii calculating the ratio Kc/N 0 which represents fee rale of disulfide bond loop 
closure in fee peptide. 

25 .. . The method of claim 27 wherein fe^ rate is compared to fee rate of dissdfide-botMi 
loop elositre of the pt>tem coBtaxumg at least 01^ dilfe^ 

29. The method of claim 27 ihifeer comprisfeg the step of gemmating peptide bacM>one 
coordinates for fee.C-JC^C' itiotif fiom standard boM angles, bond lengths and. 
dihedral angles raMo allowed regions of a Raniachari<fean 
map for each residue to yield fee 

30. The oiethod of claim 29 ferfeer comprismg the step of using a side chain rotamer 
library to generate OX^C side chain coordinates to yield the multiplicity of 
con&mxers of the peptide. 
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3L The method of cMm 27 wliai^ii am allowed C€>tifo3*mers 

cmppri&es e^ufefeg tfeo fr&o mei : gy of '^m;<SG^sm^^h^^^ ijpoft^&e solvent 
accessible surface area. 

32. Wm mp$m4 of elaim ;p m&lymztg ifee stedoally sic? wed cosifo?^^ 
compxlses fiexiMy modeling fce.oysteiiie sids <&aiixs, 

33. "file m^ttod of tiaim 27 ^ mid : 'M Q toy tie 
diifeenee Sp and disnlSie fbmss of the G-JC^C 
inptif s&d caicpl^tiog#se ratio 

which reptesetxis t&e e^ergy^weightfed rale of dfeulfide loop closure m tb.e protein. 

34. fiie tm^^S^i^mm 33 Srtte ooBspisis^^^^ of N c 
eonfoim&rs of the protein feat potentially fen3i ml i^amoieoiilaf disulfide bond. 

35, 1^ confbrm^m to a binding 

site oh a te^ oft 

biomoieeule to yield an ensemble of aligned co^feixi^s; asid 

ii perf^^ ensemble of aligned 

cpiifoimors to yield an emem&te of Nb stetfealiy allowed coMMiiiers tfe&tbi&d ^ 
biomoieeiile, 

36. The x&ethod of claim 35 whsi*ein docking the eiisaBifafc of M c ocmJfcrrsiers to a binding 
site ob & template Momofeoale >^ 

L ahgmng the ensemble of ^ on a template 

biomoiec^le to yMd s&msem^ 

it pm:$mxms & van der Waais oaiculation on the eiisemfok of aligned 
con^rmers to yield an ensemble of Nb sterfcaliy allowed oonfbrmers that hind to the template 
biomolecule. 
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37* The method of clmm 2? wfeatsm the protein fether comprises a plBrshty of two- 
cysteine ihqtifs xpp^mmt^A % C-3C sr C whereM b is W^^ndeHtly an integer for each 
two -cysteine Hiotif 

38, A oor^pistei^readaM stored fierem & softwBr^ pmgrani which 
executes the steps of claim 27. 

39, A matiiod for assessing the feiiielkig aflMity of a potem to a t^sipiate mGl^ie, 
wherein the protein ^Offiprlses at lea^o^ C^rC 
where. ii is an integer the method comprising: 

L docking the ensemble of Mc cosifGrmers to a binding site <m a template 
Momofecule to yield m ensemble oFN^ confbrjne^s that hhid the template biomoleoule; and 

M, cdc^latiiig.lhe ratio N^/H c wMch i$ iBdicatwe of fee Mnding affinity of the 
protein ibr the template hlomolcc&Ie. 

40, Amefeod for assessiiig -&e hh^hig ^ a template inofectile. 
Wherein the proteiB comprises at least one Iwo-eyBteiEe motif repressed fey C-Xjr C 
wham si is ao iiitoger, ths irrf^ of; 

L soreeBing a population of candidate peptides eonsprfssiig at least ose two- 
cysteine Biotif represented by C-IX^ C where rrm^ 
peptide that are capafel© of fiipmg asi h^^ole^ m& 

ii pefionmug fee metMod of claim 39 oa at least one eaiiciMate peptide that are 
capable of footing an hitr^oIecMar <Msiilfi<Je horid M assess the biridrog a^Smty of the 
caan-did&te p ©pffie, : 

4L The method of claim 40 wherein the each aa^dxd&te peptide comprised a preselected 
amino add se^U€3ice: 

42 , The method of claim 41 wherekrthe pt^ the 

43, The method of claims 42 wbierem the desired secoisda^ 

44, A mathod &r modifpiig a protein comprising the steps of: 

L evaluating m X-ray crystal struotiire or a nuclear magnetic resonance solution 
stmeiure comprismg an oxidized mfere^ Bioleo^e, : :: ^^^Ses^&m^ 

proteiE eoiBpdsiBg M le^st one intramolecular disulfide feon4 to itotify at least two amino 
acids at posiitcms disulJide hoi# fermatioBj 
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ih ssibstitufe^ s^ids -M ih^i^t^mptotma to yield 
^ modified pm 

iii icMitifpng an exismifeie o€ co^iforine^s of tip modified protein that can 
potential^ 

iv. docld&gife 

Biomolecpie to yield an ensemble of B3b poiifoii^lers &at Mmd the template bioxnolecuie; 

v. . ealoulatmg tlie ratio H^M C . which is indicative of the binding afSmty of the 
modified protein ^ and 

at digetBiit positions and IdaBti^iag modified peptides wife fee Mghe&t ratios. 

4S> Tlie: method of oMm ^ wh^reiii the ideaitliyiiig an emei^le step c^prise^ the stsps 
of: 

i < Menti iyMg a first eoBfon&er of the protexB that are capable of forming a first 
mtramoieciiiar disulfide bond de&h^ loop; 

n, eon^ramh^ and 

iii identifying a second coB&mxer of the proieiii that are oapahie of lomikig a 
second intramofeoular disulfide bond defining a second longer disulfiSe-bonded loop. 

4& The method of ehntia 44 wherehi a sec^^ sifier about 5 to 

about 1 0 attempts to idMtify saM coiiibrsiier, the Method the steps 

of: 

L elimiitating .t&a first : disulfide bo&d from the model; 

ii idarihfpng a first coii&roi^r of tfee peptide that can potential iy form a first 
mtr^olecBlar disnlfi de Ixmd dcfmiBg a diffepit first dis0lfid©-hoi5ded loop; 

iii, eonstraking the mode! By tte first disMfide bond; and 

v. ideMij^tog a seoortd ^onform^r of the peptide thai can potentially form & 
second mtt^ second longer disidfide-bonded loop, 
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47. A Bsethod for assessing the binding aiSml^ .&£:^ftc^^^.^m^s^- molecule, 
wfei the protein coropises a flexible loop, &a maffiod 

i gei^rallng a peptide eoiifeD^ startiiig residue I and 

iriatobi&g to a target residrte I + N oil fise peptide model; 

iL accepting the loop conformation w&^ti &e deviation between rssidtxe N and 
tiie target reside is small; 

iil> closing ipe feop p^iiig a geometric miMHBi^^ioii method* 

iv. selecting fee rasidis^ coi^imation by fe of claim 29; 

v. Ipncmti^ 

yi estimating &e biojcliiig affimiy by testing the docking of the fell m&ii«pratelii 
e&semfele and peptide taj^icoistal^^ 

48. A proisip, produced by profem mimatiinzatioB eon^rising mode&g & protehx to have 
the Becessary acti ve site conibxiiiatioii losing the Method of claim I while reducing tlie 
total number of amiBD acids m the protein.. 

49. A protein enable of docfeh^ of a portion 
of said protein wais co^fr&ised by the introdiietio^ of a. disttlfide bond by fee method 
of claim L 

50. A protein, created by the method of claim t , having the eteact^istfe of MnbitMg the 
binding of a Yiii*s M^ of a 
toxin and compiisc^ M least mm loop cpttsts^iiied by a disulfide, 

51 . Ab ensemble of iistramoleml^ of said loop from 
the protein of claim 50, 

52- A protein having d^reassd coBfoTOational etrSopy loss upon binding to a template 
molecmie In comparison to the tiaferally occurrmg protein due to the constraint of at 
least one loop of m imstabfe region of a psroteiix M eoHlbrmatiOBal space i>y tbe 
fennation of & disirffide bond otbsr than disulfide bonds foondiii the nafcrally 
occurring protein using the method of claim 1. 

53- An TOsemb^ 
protein of olmm 52, 

54. A protein modiiied by the method of claim 44. 
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55; A eoiB|>iitsr ^t^'fm€^s^Z ^ moM&&^otmn^ said system coir^rismg: 
ix> a database coBtaiaiag a set of px-otelti segixenee data; 

a software progia^i coupleci with said database* Hm mftmt^.pxo^m^.M^pi^d 
for pex^iimag the steps of ; 

(:a) generating random eonllOTiaiianaJ angles from fee set of protein 
sequence data, 

(b) generating a protein backbpBe using tbe eonfbrmatianal angles, 

(c) per&miing a van fa Waals ealcBiatioli of tiie protem backbone* 

(d) calculating a s^veiit acceEsible surface based energy of eonfbrmers 
thai are generated in steps (a) — (c), 

(e) modeling disulfide bonds in the protein backbone; 

(f) perforniiiig a van der Waals calculation ^^^^^ disulfide bonds; 

(g) calculating a solvent accessible surface based energy of confbmiers 
thai axe generated In steps |a) - (f| ; and 

(h) creating 0m modified prolsin wife stmctnral characteristics found 
in the- above steps, 

56. A compiiter syBtaei fer designing 

L a database oontainnig a set of prof em sequence data; 
if. a software program coupled 
for perfbin^g the mm «tf r 

(a> generating randomly ooufbn^atx^nal angles ill allowed region of 
R^aehandran maps ftoni the set of protein sequence data; 

(b) generating a protein backbone using said coi^raiatio^al angles; 

(c) adding a rotainer to residues in tfae backbone; 

(d) determining disulfide bonds in the potein backbone; 

(e) calculating linear ccmibimers; 

(f) caic^lating solvent accessible surface based energy of con&rmers th at 
are generated in step s (a) - (d); aM 

(g) creating the modified protein using structural ebaracteristi es identified 
in ilw above steps. 

57 . The counter ^ an 
externa! program im ealenlaiing confbrniers. 
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Input: sequence, disulfide bond connectivity, and o&er parameters 
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Generate 4> } % m (randomly) 



Generate polypeptide 
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Calculate SAS-bassd energy 
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Disulfide bond maddfcg 
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A flow chariot the program MPPOP 
(tli© fast mocil 
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Input: peptide sequence, disulfide bond Mrmectivity and other 
parameters 



Randomly geaer^te ^, *¥? a> in the allowed regions of the REmacfea^draB 
maps (Gly t Pro, CB -br&nchecl, and the rest of the amino acid) , Assign the 
angles Id each residue of the baclcbone. 



| Generate backbone ^toms H* C starting 1 


rom three given atoms. 
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| Check the CA-CA md CB-C8 distance pairs. CB were added to the 
| CYS sbcS deleted after using ft. 
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C3e^erate the i^est of backbone atoms CB, 0 (KM 
and MCA optioml). Do vdw check with all other 
atoms after each atom is added. 



Modeling the SS bond and mcord the SS coonMnate 
206 S. pairs for Ister (the final} vdw cheek. 



C&nimmes to n ext page 
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Contitmes fr&m previous p&ge 
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Add rotaiBers (four options) ro each residue except 
CYS* Do vdw check after each rotamer (a group of 
atoms) *s added. 



Obtain all the norse- 
vdw-vk>laiioii 
retainer for each 
residue. Obtain all 

fee rotamer, and also 
do rotamer pair vdw 
cheek* 



Writs i&e bacls^o^ 
PDB file m one filej 
and ^dtc ali 
the rm&mer 

another Sis 



Three -options to add rotamers* 
Do vdW check for aH of them 
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■Wnte tli© FPB fife f 



all the Input arid 
statistic i&mrmatioa iile 



Calculate the S AS-based 
energy for each conformed 
AG; Probability 
P^ttitiorvS^^ G/TO 



Ho. 



] Do vdw cheek for all of ilm S3 
21 ll P a ^ the complete SB; 
| c^nlbrmer . Pick up the best SS 
$ bond. 







j Po ■"**bisiding* T test for each ] 







Three options for atom pair Ym der Waals contact distances check. 

Ggslosi 4y normal distance; 2 t extreme distance; 3, calibrated distance. 

27^^ wken d&ittg vdw check. 
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A flow chart of the program MPMOD 
(the slow mod) 
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Input; pep ticte seqoeace, disulfide bond caaaecti^ky ped other j 
parameters j 



Randomly generate %\ £s> lis the allowed regions of the 
Ramach^ndran maps (G!y, Pro, CB -branched, md iM rest of 
the amisso &ek!)* Assign fee angles to each residue of the 
I backbone. 



302] 



Generate backbone ata^s C starting &o*n 



three ,gis?esv ktdnis. 




| Generate backbone atoms CB > 0 {BB m$ H£j& 
3 03 f optional). Do vdw c&eek with ail; other atoms 
I after £aqh atom is a<|decL 
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Add rofcamsrs (Shree optab&s) to each residue 
feel^mg CYSv 0o vdw check after each garner 



(a group of atom^ Is added (See separate page); 



Bad 



Continues t® nmipmge 



Fie, 1« 
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2 t Modeling disulfide 
beads record the SS pairs. 

3* £>o vdw e&j££k lor the SS 

conforoier 
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No 



Yes 



Record tfes msmfeer of 
conformed SS toa&. 
Calculate iM S&S^bmtA 
energy AG for each corafori&er- 



£k> all fee statistics (mo 



Record the swmber of 
conformer that cm not 
form SS bond. 

Link Corex program sad 
ea&ulate the 5 AS-based 
s^ergy AG for each 
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BOX B. OBSERVATIONS WHERE UNITY OF IN^ENTIOM IS lACWm& 

This apptarian cantatas the following inventions or groups of inventions which are not so jinked as to fam a single inventive 
concept under PCX RiM 13,1, In order for all Inventions to be searched, the appropriate additional search fees must be paid, 
Group l { dmms i-l5 f 48-33 and 55, dr^wn to a rampiite^^ protein with step of modeling 

disulfide bonds. 

Group II, claims 16-22, drawn to a mmputer-dss^ed method for use in modifying protein with step of determining d&uSfkfe 
bonds. 

Group HI, daims 23 - 26. 56 and 57, drawn to a computer assisted method fer use In modifying protein that cafelafc&s linear 
conformation & COR EX with the step of generating random conformational angetes to allowed region of Ramachandran maps. 
Group IVY claims 27-38, drawn to a metilbd tor ddterrninmg fee rate of tllsulM^ bond loop dosure in a pfofefe 
<5roup V ; claims 39 - 43, and 47 r drawn to a method for assessing the bsndsng affinity of a protein to a template molecule* 
Group VI, claims 44 - 46, and S4 t drawn fco a method % modifying p^qtg^ cr^s^l^ructyre or NMfc 

sokition structure, 

JUe inventions listed as Groups I - VI do not retete to a single inventive concept under PCI Rule 13.1 because, under PGT Rule 
13.2, they lackl^e same or corresponding special ^chr^csl features for the FoJIpwing r^asp^s; 

The special technical features far the Inven^pns of GrpMp lis a computer-as&sted method for use ln' : 'nk^i^n^--pfot^>w^ 
step of modeling disulfide bonds. 

The special technical, features for ifte inventions of Group II is a computer-ass^sted method for use m modifying protein with 
step of defeerm^fcQ disulfide bonds. 

The speda* fcechmca! ^atirres fcr the inventions of Group HI is a computBf-assisted method for use In modifying protein that 
calculates linear confermafei Is OOREX with the step of generating random conformational angeles in avowed reg&n of 
Ramachsndran maps. 

Hie special technicai features tor the inventions of Groups IV a ' method . % ^emiinmg -the rate, of : ^si#de • bond Soap dosure m 
a protein. 

The spedai technkBl features for the inventions of &roup V is a method for assessing the bsndlng afftnity of a protein to a 
template moiecuie. 

The speciai- tecr^ksl futures fof the irivention^ of Orcsip- VI & a mettiod^ X-ray crystal 

structure or NMR solution structure. 
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